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The  purpose  of  this  thesis  is  to  suggest  an  alternative  to 
the  \-calculus  for  automating  higher-order  logic.   It  is  proposed  that 
higher-order  problems  "be  formulated  using  an  n-sorted  logical  language. 
Such  a  language  is  a  first-order  language  in  which  there  are  a  finite 
number  of  distinct  types  of  individuals.  The  motivation  for  this  can 
be  seen  as  follows.   Let  there  be  two  types  of  objects,  say  type  1 
and  type  2.   Let  a  be  an  object  of  type  1,  b  an  object  of  type  2,  and 
P  a  two-place  relation  symbol  whose  first  argument  is  of  type  1  and 
second  argument  is  of  type  2.   In  a  natural  way,  a  can  be  associated 
with  a  predicate  over  objects  of  type  2  as  follows:  the  associated 
predicate  holds  for  b  if  and  only  if  P(a,  b)  holds.   Then,  if  one 
quantifies  over  a  type  1  variable,  one  is  essentially  quantifying  over 
the  names  of  predicates.   By  adding  more  types,  one  can  extend  this 
idea  to  the  full  hierarchy  of  higher-order  logic. 

The  main  body  of  the  work  consists  of  essentially  three 
sections.   In  the  first  section,  the  two  languages  dealt  with,  the 
X-calculus  and  n-sorted  logic,  are  described,  and  the  relavent  facts 
about  each  are  mentioned.   It  is  also  notes; that  trivial  modifications 
to  the  proofs  of  standard  theorems  on  resolution  make  those  proofs  hold 
for  n-sorted  resolution  as  well.  In  the  second  section,  it  is  shown 
how  to  translate  a  sentence  A  of  a  X-calculus  into  a  well-formed  expression 
B  of  an  associated  n-sorted  logic.   The  translation  preserves  validity, 
or  equivalently,  inconsistency;  that  is,  if  A  is  valid  (in  the  Henkin 
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sense ),  then  B  is  valid.   Finally,  it  is  shown  that  by  adding  a  set  T 
of  axioms,  called  Henkin  axioms,  to  E  one  has  that  A  is  valid  if  and 
only  if  3  U  T  is  valid.   The  Hehkin  axioms  include  among  others,  a  set 
of  axioms  which  are  analogues  of  the  standard  comprehension  axioms  of 
logic.   In  the  last  section,  it  is  shown  that  the  Henkin  axioms  can  be 
put  into  a  special  form  which  is  compatible  with  the  structure  of  data 
used  by  resolution  theorem-provers .  A  new  rule  of  inference,  called 
naming,  is  proposed  to  eliminate  the  need  for  actually  storing  the  Henkin 
axioms  in  memory. 

The  work  contains  an  appendix  with  some  examples  of  higher- 
order  problems  solved  using  an  n-sorted  representation  and  the  resolution 
and  naming  rules. 
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1.   INTRODUCTION 

1.1  Historical  Background 

In  this  section,  a  brief  history  of  automatic  theorem-proving  is 
given . 

One  approach  to  the  general  topic  of  artificial  intelligence  is  to 
use  the  language  of  mathematical  logic  to  express  one's  problems.   One  can 
then  apply  the  we 11 -developed  theories  of  mathematical  logic  to  obtain 
algorithms  for  actually  solving  problems.   These  problems  are  almost  always 
of  the  form,  "Determine  if  a  particular  statement  S  of  the  language  is  always 
true". 

For  one  of  the  basic  languages  of  mathematical  logic  called  first- 
order  logic  (see  Section  2.3  for  a  precise  definition  of  first-order  logic), 
the  property  of  a  statement  S  always  being  true  is  equivalent  (Church  [2]) 
to  S  being  a  formal  theorem  (again  see  Chapter  2  for  a  precise  definition  of 
formal  theorem) .   Thus,  algorithms  that  handle  such  problems  are  called 
theorem-proving  algorithms  or  theorem-provers,  and  computer  programs  that 
embody  such  algorithms  are  called  automatic  theorem-provers.   Of  course,  it 
is  well  known  that  there  does  not  exist  an  algorithm  which  will  determine 
for  an  arbitrary  first-order  statement  S,  whether  or  not  S  is  a  formal 
theorem  (Church  [2]).   That  is,  first-order  logic  is  undecidable.   However, 
there  do  exist  algorithms  which,  if  given  a  first-order  statement  S  which  is 
always  true,  will  demonstrate  that  S  is  always  true.   Such  an  algorithm  is 
called  a  proof  procedure.   Determining  that  S  is  always  true  is  equivalent 
to  determining  that  the  negation  or  denial  of  S  is  always  false  or 


2 
inconsistent.  Most  algorithms  for  handling  first-order  logic  actually 
demonstrate  that  the  denial  of  S  is  inconsistent  and  are  called  refutation 
procedures.  However,  because  S  is  always  true  if  and  only  if  the  denial  of 
S  is  always  false,  the  two  terms,  proof  procedure  and  refutation  procedure, 
are  used  interchangeably. 

All  recent  proof  procedures  (except  the  simple  minded  procedure 
of  listing  all  formal  proofs,  and  hence,  all  theorems)  are  based  on  a 
fundamental  theorem  of  Herbrand  [ 9]  and  a  method  proposed  by  Skolem  [  19]  in 
1928  (see  Prawitz  [13]).   Herbrand's  Theorem  showed  that  one  could  demonstrate 
the  inconsistency  of  a  first-order  statement  S  by  generating  a  sequence  S,,  Sp, 
...  of  statements  of  Boolean  algebra.   The  Boolean  statements  are  calculated 
from  S  itself,  and  Herbrand's  Theorem  states  that  S  is  inconsistent  if  and 
only  if  some  S,  is  inconsistent.   However,  Boolean  algebra  is  simpler  than 
first-order  logic  in  that  it  can  be  determined  for  each  Boolean  statement 
S.,  whether  or  not  S.  is  inconsistent;  that  is,  Boolean  algebra  is  decidable. 
Skolem  then  proposed  as  a  proof  procedure  calculating  the  sequence  S,,  S  ,  ... 
specified  by  Herbrand's  Theorem  and  examining  each  S.  for  inconsistency. 

Unfortunately,  this  method  was  of  little  practical  use  at  that  time. 
The  calculations  for  most  problems  were  too  long  to  be  carried  out  by  hand. 
Moreover,  if  the  statement  S  was  not  inconsistent,  no  S.  in  the  sequence 
(which  is  usually  infinite)  is  inconsistent.   Thus,  theoretically,  a  person 
would  continue  calculating  forever.   (This  last  problem  is  still  present, 
today,  of  course.)   In  the  late  1950's,  after  the  development  of  high-speed 
digital  computers,  mathematicians  again  considered  Skolem' s  procedure  for 
automating  first-order  logic.   While  the  problem  of  calculating,  or  in  this 


case  computing,  forever  when  S  was  not  inconsistent  was  still  present, 
researchers  developed  faster  ways  of  generating  and  testing  the  S.'s  (see, 

for  example,  Gilmore  [6]  or  Davis  and  Putnam  [  5] ) ■ in  the  hopes  of  overcoming 
the  magnitude  of  the  calculations.  However,  even  these  programs  required 
times  in  the  neighborhood  of  half  an  hour  on  an  IBM  7090  to  prove  some 
elementary  theorems  from  group  theory.   The  size  of  the  Boolean  expressions 
S.  increased  tremendously.   Thus,  even  the  fastest  methods  for  checking 

inconsistency  were  still  not  fast  enough  to  make  a  practical  automatic 
theor em-pr over . 

Then  in  19&5,  J>  A.  Robinson  [15]  published  a  simple  method 
consisting  of  a  single  operation  called  the  resolution  rule.   The  proof  that 
the  resolution  rule  was  sufficient  to  demonstrate  inconsistency  was  still 
based  on  Herbrand's  Theorem.  However,  the  resolution  rule  operates  on  S 
itself,  thus  eliminating  the  need  to  generate  the  rapidly  expanding  sequence 
of  Boolean  expressions. 

Since  1965,  researchers  have  developed  many  refinements  of  the 
resolution  rule  and  many  strategies  for  applying  these  rules  to  the 
subexpressions  of  S  (see,  for  example,  Andrews  [1],  Loveland  [11],  Luckham  [12], 
Robinson  [l6],  and  Wos,  Robinson,  and  Carson  [21]).   The  problems  of  group 
theory  that  previously  required  several  minutes  to  a  half -hour  on  an  IBM  7090 
can  now  be  solved  in  a  few  seconds  on  an  IBM  3^0. 

There  are  two  directions  being  taken  by  researchers  in  automatic 
theorem-proving,  today.   First,  work  is  being  done  to  develop  special  purpose 
theorem-provers  for  handling  particular  kinds  of  problems.  Work  on  question- 
answering  and  information  retrieval  programs  (see,  for  example,  Green  [7]  or 
Darlington  [3]),  theorem-provers  that  handle  equality  (Robinson  and  Wos  [1^]), 
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and  theorem-provers  incorporating  induction  schemata  fall  in  this  category. 
The  second  direction  is  the  attempt  to  develop  theorem-provers  for  a  class 
of  logical  systems  grouped  together  under  the  heading  higher-order  logic 
(Robinson  [IT])-   It  has  been  pointed  out  (Davis  [h])   that  these  higher- 
order  systems  can  be  expressed  in  first-order  logic.  However,  until  now  no 
such  representation  has  been  given  which  would  result  in  first-order  statements 
that  were  not  beyond  the  capabilities  of  any  automatic  theorem-prover  avail- 
able today.   On  the  other  hand,  many  mathematical  problems  can  be  expressed 
very  easily  and  naturally  in  these  higher-order  languages.   It  is  the  pur- 
pose of  this  thesis  to  show  how  higher-order  logic  can  be  expressed,  in  a 
natural  way,  in  a  first-order-like  language  called  n-sorted  logic.   The 
translation  from  higher-order  logic  to  n-sorted  logic  yields  statements  which 
are  of  a  reasonable  size  for  current  theorem-proving  programs.   Finally, 
some  suggestions  as  to  the  possible  implementation  of  a  resolution  style 
automatic  theorem-prover  for  this  representation  of  higher-order  logic  will 
be  made. 
1.2  Outline  of  the  Work 

The  introduction  gave  a  brief  history  of  automatic  theorem-proving. 
The  remainder  of  this  work  is  divided  into  five  sections,  four  more  Chapters 
and  an  Appendix.   Chapter  2  contains  descriptions  of  the  two  languages  to  be 
used--the  A.-calculus  and  n-sorted  logic.   Some  basic  theorems  about  these 
languages  will  be  stated.   Finally,  it  will  be  shown  that  the  resolution 
rule  for  first-order  logic  is  extendable  to  n-sorted  logic.   Chapter  3  contains 
the  major  results  of  this  work.   It  is  first  shown  how  to  translate  the  \- 
calculus  into  an  axiomatic  theory  of  n-sorted  logic.   It  is  then  shown  that 
this  translation  preserves  the  property  of  being  inconsistent.   In  Chapter  h, 


the  naming  rule  is  introduced  as  a  possible  alternative  to  including  certain 
of  the  axioms  given  in  Chapter  3»   Chapter  h   also  contains  some  comments  about 
the  completeness  of  resolution  and  naming  for  higher-order  theorem-proving  in 
n-sorted  logic.   Chapter  5  contains  some  concluding  remarks  and  suggestions 
for  further  research.   Finally,  the  Appendix  contains  examples  of  higher-order 
problems  solved  by  the  method  of  Chapter  k. 


2.   THE  \-CALCULUS  AND  n-SORTED  LOGIC 

2.1  Preliminary  Remarks  on  Defining  a  Logical  System 

This  chapter  will  describe  the  two  logical  systems  to  be  considered, 
the  \-calculus  and  n-sorted  logic,  and  the  relevant  facts  about  each  one. 
In  describing  a  logical  system  it  is  necessary  to  give: 

1.  the  basic  symbols; 

2.  the  way  these  symbols  are  to  be  combined  into  well-formed 
formulas  (hereafter  abbreviated  wffs ) ;  and 

3-   how  the  formulas  are  to  be  interpreted  or  how  they  are  to  be 
manipulated  or  both  depending  on  whether  the  mathematician  is 
interested  in  the  existence  of  models  for  his  formulas  or  in 
rules  of  inference  and  formal  proofs. 

These  two  systems  are  well-known  to  logicians,  and  their  descriptions 
are  given  for  reference  only. 

2.2  The  ^.-Calculus 

This  section  contains  a  description  of  a  logical  system  called  the 
\-calculus.  Most  of  this  material  is  taken  directly  from  Robinson  [18],  and 
hence  the  reference  will  be  made  only  here  at  the  beginning  of  this  section. 

The  symbols  of  this  language  consist  of  type  symbols,  identifiers, 
the  symbol  "A.",  the  symbol  "->",  and  the  open  and  close  parentheses,  "("  and 
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Definition  2.2.1 

The  set  of  types,  usually  denoted  by  T,  is  defined  inductively: 

1.  a  finite  set  of  symbols  including  the  symbol  TRUTHVALUE, 
(abbreviated  as  "TV")  is  given  as  the  set  of  simple  types; 

2.  if  a  and  f3  are  types,  then  (a  -»•  p)  is  also  a  type  and  is  called 
a  complex  type. 

Finally,  a  type  is  either  a  simple  type  or  a  complex  type. 

There  is  an  infinite  number  of  types  for  given  any  two  types,  say 
a  and.  p,  a  third  type  (a  -*   (3)  must  be  present.   Thus  starting  with  the  finite 
set  of  simple  types,  one  can  construct  inductively  an  infinite  set  of  complex 
types  which  are  also  types.   NOTE:  The  language  to  be  described  in  Section 
2-3  also  contains  types.  When  possible  confusion  may  arise,  types  of  the  X- 
calculus  will  be  referred  to  as  \-types. 

Definition  2.2.2 

The  set  of  identifiers  consists  of 

1.  the  special  identifiers  "OR",  "AND",  and  "IMPLIES"  all  of  type 
(TV  -  (TV  ->  TV)),  "NOT"  of  type  (TV  -»  TV),  "TRUE"  and  "FALSE" 
of  type  TV,  for  each  type  a   the  symbol  EQUAL  of  type 

(a  ->  (a  -*  TV)),  and  for  each  type  a  the  symbol  CHOICE  of  type 

((a  ->  TV)  ->  a),  and 

2.  for  each  type  a   infinitely  many  non-special  identifiers  of  type 
a. 

The  symbols  used  for  the  non-special  identifiers  can  be  chosen 
freely  for  their  mnemonic  value  subject  only  to  the  restrictions  that  they 
be  distinct  from  each  other  and  from  the  special  identifiers.  Each  identifier 


8 


is  associated  with  a  type.   Identifiers  of  type  (a  -*  TV)  are  also  called 
predicate  symbols  of  type  a. 

Definition  2.2.3 

A  wff  is  a  sequence  of  symbols  defined  inductively  as  follows: 

1.  an  identifier  of  type  a  is  a  wff  of  type  a,  where  a  is  a  type; 

2.  if  X  is  a  non-special  identifier  of  type  a   and  B  is  a  wff  of 
type  P,  then  (XXB)  is  a  wff  of  type  (a  ->  p),  and  is  called  an 
abstraction; 

3-   if  F  is  a  wff  of  type  (a  -»  p)  and  B  is  a  wff  of  type  a,  then 
(FB)  is  a  wff  of  type  p  and  is  called  an  application. 

As  with  identifiers,  each  wff  has  an  associated  type.  A  wff  of 
type  (a  -*   TV)  is  called  a  predicate  wff  of  type  g,  and  a  wff  of  type  TV  is 
called  a  sentence. 

The  reader  may  have  noticed  that  all  functions  appear  to  be  unary 
when,  in  fact,  not  all  functions  the  mathematician  deals  with  are  unary.   The 
method  of  handling  this  discrepancy  can  be  illustrated  using  the  standard 
"logical  or"  function,  v,  usually  considered  to  be  binary.   Given  two  truth 
values  a  and  b,  a^b  is  another  truth  value.   Suppose  only  one  truth  value, 
say  a,  is  given,  and  write  a^.   If  now  a  second  truth  value,  say  b,  is  supplied, 
the  result  a^b  is  indeed  a  truth  value.  Hence,  the  object  "a^"  produces  a 
truth  value  when  given  a  truth  value;  that  is,  a^  is  itself  a  unary  function 
from  truth  values  to  truth  values.   Thus  the  binary  function  can  be  considered 
as  two  unary  functions,  the  unary  function  v  which  yields  the  second  function 
av  when  applied  to  an  argument  a.   Thus  v  is  of  type  TV  ->  (TV  -»  TV).   In  a 
similar  manner,  any  n-ary  function  can  be  considered  as  a  sequence  of  n 
unary  functions.   This  is  the  point  of  view  taken  in  the  ^.-calculus. 


Example  2.2.1 

Suppose  INTEGER  and  TV  are  the  only  two  simple  types.   Then  the 
set  of  complex  types  includes,  among  others, 

1.  (INTEGER  -»  INTEGER), 

2.  (INTEGER  ->  (INTEGER  -»  INTEGER)),  and 
3-   (INTEGER  -»  TV). 

Suppose  the  non-special  identifiers  include  0,  1,  2,  ...  all  of  type  INTEGER, 
S  and  UMINUS  (denoting  unary  minus)  both  of  type  (INTEGER  ->  INTEGER),  PLUS, 
MINUS,  and  TIMES  all  of  type  (INTEGER  ->  (INTEGER  -*  INTEGER)),  and  GREATER - 
THAN-ZERO  of  type  (INTEGER  ->  TV).   Then  the  following  are  some  examples  of 
wf f s  and  their  types : 

1.  (UMINUS  2)  of  type  INTEGER; 

2.  (MINUS  5)  of  type  (INTEGER  ->  INTEGER); 
3-   ((MENUS  5)  7)  of  type  INTEGER; 

k.      (TIMES  1)  of  type  (INTEGER  -*   INTEGER); 
5-   ((TIMES  1)  1)  of  type  INTEGER; 

6.  (\1(  (TIMES  1)  1))  of  type  (INTEGER  ->  INTEGER); 

7.  (( IMPLIES (GREATER-THAN-ZERO  l) ) (GREATER-THAN-ZERO(Sl) ) )  of 
type  TV. 

In  the  above  example,  the  reader  has  probably  already  guessed  the 
intended  meaning  of  the  identifiers.   The  numbers  0,  1,  2,  ...  were  meant 
to  name  the  integers,  S  the  successor  function,  UMINUS  the  unary  minus 
function,  etc.   Of  course,  one  could  have  some  other  assignment  of  values  in 
mind.   In  general,  it  is  intended  that  identifiers  corresponding  to  a  simple 
type  stand  for  individual  objects  while  identifiers  of  a  complex  type,  say 
(a  ->  p),  stand  for  functions  from  objects  of  type  a  to  objects  of  type  p. 
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An  application  (FB)  stands  for  the  value  of  the  function  F  when  applied  to 

the  argument  B.  An  abstraction,  say  (\XB),  defines  a  new  function  by  listing 

the  argument  of  the  function,  in  this  case  X,  and  the  value  of  the  function 

for  the  argument,  in  this  case  B.  The  special  identifiers  are  meant  to 

stand  for  the  functions  normally  associated  with  them.   That  is,  OR  is  meant 

to  stand  for  disjunction,  AND  for  conjunction,  IMPLIES  for  implication,  NOT 

for  negation,  EQUAL  for  equality  between  objects  of  type  a,    and  CHOICE 

a  a 

for  a  choice  function  of  type  a-   Of  course,  these  functions  are  now  to  be 

considered  as  sequences  of  unary  functions  as  described  above  for  disjunction. 

Finally,  TRUE  is  meant  to  stand  for  the  truth  value  "true"  and  FALSE  for  the 

truth  value  "false".   Note  that  the  symbol  "TRUE"  is  an  identifier  of  the  \- 

calculus,  while  "true"  is  the  value  of  that  identifier.   Thus  the  symbol 
"TRUE"  is  used  in  constructing  a  wff,  while  the  value  "true"  may  be  the  value 
of  a  sentence.   Similarly  for  "FALSE"  and  "false",  and  the  other  special 
identifiers  and  their  intended  meanings. 

The  only  function  mentioned  above  which  may  not  be  familiar  to  the 
reader  is  the  choice  function.    Its  definition  is  given  here  for  reference. 


Definition  2.2.^ 

A  choice  function  of  type  q  is  a  function  c  which  takes  as  an 
argument  an  object  s  of  type  (a  -*  TV)  and  yields  as  its  value  an  object  c(s) 
with  the  property  that  s(c(s))  is  true  if  there  are  any  objects  of  type  a 
at  all  for  which  s  has  the  value  true. 

The  above  remarks  about  the  intended  meanings  of  identifiers  will 
now  be  made  precise. 
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Definition  2.2.5 

An  interpretation  or  Henkin  frame  is  a  pair  (g,  D)  satisfying  the 
following  conditions: 

1.  D  is  a  set  of  domains,  one  for  each  type  in  T;  if  a  is  a  type, 
the  domain  associated  with  type  a  is  denoted  by  D  ; 

2.  g  is  a  map  of  identifiers  into  D  such  that  if  X  is  an  identifier 
of  type  a,    g(X)  is  an  object  in  D  ; 

3*   DTRUTHVALUE  =  [tT™>    f^S^> 

k.      D/    _n  consists  of  functions  from  D  into  D„: 

5-   g  maps  the  special  identifiers  onto  their  regular  meanings; 
that  is,  g(NOT)  is  negation,  g(OR)  is  disjunction,  g(AND)  is 
conjunction,  g(lMPLIES)  is  implication,  g(EQUAL  )  is  equality 

between  objects  of  type  a,  and  g(CHOICE  )  is  a  choice  function 

for  objects  of  type  a; 
6.   B  is  closed  under  g;  that  is,  g(A)  is  in  D  for  every  wff  A 
where  g  is  extended  to  the  set  of  all  wffs  inductively  as 
follows : 

A.  g(FB)  is  the  value  when  the  function  g(F)  is  applied 
to  the  argument  g(B); 

B.  g(\XB)  is  that  function  which  yields  g'(B)  when 
applied  to  g'(X)  where  g'  =  g  everywhere  except 
possibly  at  X  and  g'(X)  is  allowed  to  range  over  the 
domain  of  D  associated  with  the  type  of  X. 

The  intent  of  property  6  is  to  ensure  that  the  domain  D  contains 
all  the  objects  that  the  language  is  able  to  "talk  about".   That  is,  given 
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the  usual  interpretation  of  application  and  abstraction,  the  interpretation 
of  every  wff  (not  just  every  identifier)  is  in  D. 

Definition  2.2.6 

A  sentence  A  is  said  to  be  valid  if  g(A)  =  true  for  every  inter- 
pretation (g, D).  A  sentence  A  is  said  to  be  consistent  if  g(A)  =  true  for 
some  interpretation  (g,D);  in  this  case,  (g,D)  is  called  a  model  of  A.   A 
sentence  A  is  inconsistent  if  g(A)  =  false  for  every  interpretation  (g,D). 

Clearly  a  sentence  A  is  valid  if  and  only  if  the  sentence  (NOT  A) 
is  inconsistent. 

Example  2.2.1  (continued) 

Let  (g, D)  be  an  interpretation  of  the  A.-calculus  described  in  this 
example  in  which  DIKTEGER  is  the  set  of  integers,  g(0)  =  0,  g(l)  =  1,  ..., 

g(UMINUS)  is  the  unary  minus  function,  g(PLUS)  is  addition,  g (MINUS)  is 
subtraction,  g (TIMES)  is  multiplication,  g(S)  is  the  successor  function,  and 
g( GREATER -THAN-ZERO)  is  the  predicate  which  is  true  of  an  integer  i  if  and 
only  if  i  is  positive.   Then  the  evaluations  of  the  formulas  listed  above 
are  as  follows : 

1.  g(UMINUS  2)  is  the  Integer  -2; 

2.  g (MINUS  5)  is  the  function  which  yields  5-a  when  applied 
to  an  integer  a; 

3.  g( (MINUS  5)  7)  is  the  integer  -2; 

k.      g( TIMES  1)  is  the  function  which  yields  l*a  when  applied 

to  an  integer  a; 
5-   g( (TIMES  1)  1)  is  the  integer  1; 
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6.  g(\l( (TIMES  l)  l))  is  the  square  function;  to  see  this,  note 
that  g(\l( (TIMES  l)  l))  is  that  function  f  which  yields 

g' ((TIMES  1)  l)  when  applied  to  g'(l)  where  g'  =  g  except  at 
1  and  g'(l)  varies  over  the  Integers;  now  g' (TIMES)  is  still 

multiplication,  so  that  g' ((TIMES  l)  1)  is  just  (g'(l))2;  that 

2 
is,  f  applied  to  an  integer  a  yields  a  ; 

7.  g( (IMPLIES (GTZ  1))(GTZ(S1)))  is  true. 

Note  that  the  values  of  the  wffs  in  2,  k,    and  6  are  functions  from  integers 
to  integers.   Because  (g, D)  was  assumed  to  he  an  interpretation,  it  must 
satisfy  condition  6  of  Definition  2.2.5;  thus,  whatever  other  functions  are 
contained  in  D(INIEGER  _>  INTEGER)'  the  °neS  listed  in  2>    ^>    ajld  6  above  must  he 
present.   Formulas  1-5  illustrate  application,  formula  6  illustrates  the 
definition  of  a  new  function  by  abstraction,  and  formula  7  shows  the  evaluation 
of  one  of  the  standard  logical  connectives.   In  formula  6,  the  identifier  1 
appears  to  play  the  role  of  a  variable  while  in  formulas  k,    5,  and  7  it  stands 
for  a  fixed  integer.   It  is  the  case  in  the  \-calculus  that  an  identifier  is 
used  both  as  a  constant  and  as  a  variable.   This  is  not  true  in  n-sorted 
logic,  to  be  described  in  the  next  section,  where  constants  and  variables  are 
represented  by  distinct  symbols.   This  fundamental  difference  in  the  two 
languages  will  arise  often  in  the  theory  to  be  presented  in  Chapters  3  and  k. 

The  next  three  definitions  single  out  wffs  of  particular  forms 
which  will  play  an  important  role  in  the  material  in  Chapters  3  and  k. 

Definition  2.2.7 

An  atom  is  a  wff  in  one  of  the  following  two  forms : 


Ik 


1.  ((EQUAL  A)  B);  or 

2.  (FB)  where  F  is  a  wff  of  type  (a  ■*  TV)  not  of  the  form  (OR  A), 
(AND  A)  or  (IMPLIES  A)  for  some  A. 

Definition  2.2.8 

A  literal  is  an  atom  or  a  wff  of  the  form  (NOT  A)  where  A  is  an 
atom. 

Definition  2.2.9 

A  clause  is  a  wff  which  is  defined  inductively  as  follows: 

1.  a  literal  is  a  clause; 

2.  if  C.  and  C  are  clauses,  then  ((OR  C.  )  C  )  is  also  a  clause. 

The  reader  may  also  have  noticed  the  absence  of  the  universal  and 
existential  quantifiers,  V  and  3.   In  fact,  in  the  \-calculus  these  two 
concepts  are  expressed  as  follows : 

V  XA  is  ((\XA) (CHOICE (\X(NOT  A))))  and  3  XA  is  ( (KKA) (CHOICE (\XA))) 
where  A  is  a  sentence. 

The  first  assertion  will  now  be  proved.   The  second  one  can  be 
proved  in  a  similar  manner.   Suppose  a  Henken  frame  is  given  in  which  V  XA 
is  true.   CHOICE(\X(NOT  A))  is  an  object  of  the  same  type  as  X,  and  so  when 
KXA   is  applied  to  that  object,  the  result  must  be  true.   Conversely,  suppose 
((A.XA)  (CHOICE  (XX  (NOT  A))))  is  true  and  suppose  it  were  possible  that  V  XA 
were  false.   Then  AXA  would  be  false  for  some  object  of  the  type  of  X,  and 
hence  XX(NOT  A)  would  be  true  of  some  such  object.   But,  by  the  definition 
of  CHOICE,  CHOICE(XX(NOT  A))  would  be  such  an  object  (that  is,  an  object 
for  which  \X(NOT  A)  were  true);  hence  (A.XA)  applied  to  CHOICE (\X(NOT  A)) 
would  yield  false,  contrary  to  the  hypothesis.   Thus  V  XA  is  true  if  and  only 
if  ((XXA) (CHOICE (\X(NOT  A))))  is  true. 
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The  language  described  above  may  be  supplemented  by  singling  out 
some  set  of  wffs,  which  are  then  called  axioms,  and  including  rules  of 
inference  for  transforming  one  or  more  formulas  into  new  formulas.   Different 
choices  of  axioms  and  rules  of  inference  lead  to  different  inference  systems. 

Definition  2.2.10 

A  proof  is  a  sequence  of  wffs  such  that  each  wff  is  either  an  axiom  or 
is  derivable  from  previous  wffs  of  the  sequence  by  some  rule  of  inference. 

Definition  2.2.11 

A  theorem  is  a  wff  which  is  the  last  wff  of  a  proof. 

While  the  notions  of  proof  and  theorem  are  natural  ones  for  humans, 
as  mentioned  in  Section  1.1,  almost  all  automatic  theorem-pr overs  (including 
the  one  proposed  in  this  work)  attempt  to  demonstrate  that  a  wff  A  is  valid, 
or  equivalently,  that  (NOT  A)  is  inconsistent.   However,  there  is  an  important 
relation  between  the  concepts  of  validity  and  proof  which  was  given  by  Henkin 
[8]. 

Theorem  2.2.1  (Henkin' s  Completeness  Theorem) 

A  sentence  A  of  the  \-calculus  is  a  theorem  (in  a  suitable  inference 
system)  if  and  only  if  A  is  valid. 

As  a  final  note  on  the  \-calculus  in  this  section  it  should  be 

pointed  out  that  the  domains  H,    0>  need  not  contain  all  functions  from  H  to 

(a  ->  P)  a 

HQ.  A  Henkin  frame  (g.H)  in  which  H/    qn  does  contain  all  functions  from  H 

to  HR  for  every  complex  type  (a  -*  p)  is  called  a  standard  frame.   Obviously,  to 

be  valid  in  all  Henkin  frames  is  a  stronger  condition  than  being  valid  in  all 
standard  frames.   In  fact,  validity  in  all  standard  frames  is  insufficient 
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to  guarantee  provability.   This  explains  the  apparent  inconsistency  between 
Henkin's  Completeness  Theorem  and  Gb'del's  Incompleteness  Theorem  for  Second 
Order  Logic.  GBdel  showed  how  to  construct  a  sentence  J  which,  when  inter- 
preted in  a  standard  frame,  had  the  meaning  that  J  itself  was  not  a  theorem. 
Thus,  if  J  was  a  theorem,  it  would  be  valid  in  all  standard  frames  (see 
Church  [2]),  and  so  by  the  previous  sentence,  J  would  not  be  a  theorem,  a 
contradiction.  Thus  J  is  not  a  theorem,  and  so  it  is  valid  in  all  standard 
frames.   Then  J  is  a  sentence  which  is  valid  in  the  standard  sense,  but  not 
a  theorem.   This  is  Go'del's  Incompleteness  Theorem.   In  constructing  J,  G'ddel 
defined  equality  between  two  objects  A  and  B  as  (v  F( (IMPLIES (FA) ) (FB) ) )-- 
that  is  A  and  B  are  the  same  object  if  for  every  property  F,  if  F  holds  for 
A  it  also  holds  for  B.   This  is  not  an  adequate  definition  of  equality  in 
some  non-standard  interpretations.   In  these  interpretations,  then,  J  will 
not  necessarily  have  the  meaning  that  J  is  not  a  theorem.   In  fact,  J  fails 
to  be  true  in  some  non-standard  interpretation.   Thus,  J  is  not  valid  in  the 
general  or  Henkin  sense  and  Henkin's  Completeness  Theorem  is  not  violated. 
While  it  may  be  argued  philosophically  which  notion  of  validity  to  accept,  it 
seems  clear  that  for  the  purposes  of  automatic  theorem-proving  the  correct 
point  of  view  is  to  take  validity  in  the  Henkin  sense,  since  this  is  the  view 
that  links  validity  with  provability  (see  Robinson  [17]). 

2-3  n-Sorted  Logic 

The  reference  for  this  material  is  Kreisel  and  Krivine  [10], 
Chapter  5-   The  notation  is  slightly  different,  but  the  concepts  are  the  same. 

In  an  n-sorted  logic,  there  are  exactly  n  types  where  n  is  some 
integer;  hence  the  name  n-sorted  logic.  Moreover,  unlike  the  X-calculus,  all 
types  are  simple  types.   When  n  is  1,  the  language  is  called  first-order. 
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Definition  2. 3.1 

An  n- sorted  language,  ¥L,    consists  of  the  following  symbols: 

1.  a  set  of  n  type  symbols; 

2.  for  each  type  i,  an  infinite  set  V  of  variables  of  type  i; 
the  set  of  all  variables  is  denoted  by  V; 

3-   for  each  type  1,    a  (possibly  empty)  set  C  of  constant 

symbols  of  type  i; 

k.      for  each  integer  k  >  1  and  each  k  +  1- tuple  of  types  (i_,  ...  i  , i) 

-L      k 

a  (possibly  empty)  set  of  k-ary  function  symbols; 
5.   for  each  integer  k  >  0  and  each  k-tuple  of  types  (i  ,  ...  i  )  a 
(possibly  empty)  set  of  relation  symbols; 

Definition  2.3.2 

A  term  of  type  i  is  a  sequence  of  symbols  of  K  defined  inductively 
as  follows : 

1.  a  variable  or  constant  symbol  of  type  i  is  a  term  of  type  i; 

2.  if  f  is  a  function  symbol  of  type  (i_,  ...  i^ji)  and  t  ,    ...  t 
are  terms  of  type  i-,,  . «.  i,  respectively,  then  f(t,,  ...  t,  )  is 
a  term  of  type  i. 

The  set  of  terms  of  type  i  is  denoted  by  T  ,  and  the  set  of  all  terms  is  denoted 
by  T. 
Definition  2.3-3 

A  wff  is  a  sequence  of  symbols  of  K  defined  inductively  as  follows: 
1.   if  P  is  a  relation  symbol  of  type  (i, ,  ...  i  )  and  t,,  ...  t 
are  terms  of  type  i  ,  ...  i  respectively,  then 
P(t,,  ...  t.  )  is  a  wff  and  is  called  an  atom; 
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2.  if  A  and  B  are  wffs,  then  so  are  (-»  A),  (A  v  B),  (A  a  B), 
(A  -►  B),  (V  x  A)  and  (3  x  A)  where  x  is  a  variable. 
When  the  relation  symbol  P  in  1  above  is  =,  the  atom  will  be  written  (t  =  t  ) 

to  correspond  to  normal  usage  of  the  equality  symbol. 

Definition  2.3.k 

The  concept  of  a  bound  occurrence  of  a  variable  is  defined  inductively 
as  follows: 

1.  in  an  atom,  no  occurrence  of  a  variable  is  bound; 

2.  an  occurrence  of  a  variable  in  (A  v  B),  (A  ^  B),  (A  -*  B),  and 
(-,  A)  is  bound  if  and  only  if  the  corresponding  occurrence  in 
A  or  in  B  is  bound; 

3-   every  occurrence  of  x  in  (V  x  A)  or  (3  x  A)  is  bound;  an 

occurrence  of  a  variable  distinct  from  x  is  bound  in  (VxA)  or 
(3  x  A)  if  and  only  if  the  corresponding  occurrence  in  A  is 
bound. 

An  occurrence  of  a  variable  that  is  not  a  bound  occurrence  is  called  a  free 

occurrence. 

Definition  2.3. 5 

A  closed  wff  is  a  wff  which  has  no  free  occurrence  of  a  variable. 
An  open  wff  is  a  wff  that  is  not  closed. 

Example  2.3.1 

Suppose  there  are  two  types,  1  and  2.   Let  SQF.T  and  #  be  function 
symbols  of  types  (1,2)  and  (2,2,1)  respectively.   Let  P  and  =  be  relation 
symbols  of  type  (l)  and  (l, l)  respectively.   Suppose  the  variables  of  type  1  are 
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are  x  ,  x  ,  . . . ,  and  the  variables  of  type  2  are  y  ,  y ,  ...  .   Then 

the  following  are  terms  of  the  indicated  types : 

1.  SQRT  (x_)  of  type  2;  and 

2.  *  (SQRT  (x^),  SQRTCx^))  of  type  1. 

The  two  expressions  P(x^)  and  (x  =  ^-(SQRT(x1),SQRT(x_  ) ) )  are  atoms, 

and  the  two  expressions 

Vx1(P(x1)  -  (3v1     (xx  =  *  (yr  Y-l)))) 

and  ((VxP(^))  -*   (x,  =  y_))  are  wffs. 

Intuitively,  the  different  types  of  the  language  are  meant  to 
indicate  distinct  kinds  of  objects.   Variables  are  meant  to  range  over  an 
appropriate  type  of  object  while  constant  symbols  represent  fixed  objects 
of  the  appropriate  type.   Function  symbols  and  relation  symbols  are  meant 
to  stand  for  actual  functions  and  relations;  the  associated  k-  or  k+1- 
tuple  determines  the  type  of  object  allowable  in  each  argument  position, 
and  in  the  case  of  function  symbols,  the  type  of  the  value  of  the  function. 
These  intuitive  remarks  will  now  be  made  clear  as  the  concept  of  a  realization 
of  the  language  K  is  introduced.  A  realization  is  the  analogue  of  an 
interpretation  for  the  \-calculus. 

Definition  2.3.6 

A  realization  of  an  n-sorted  language,  K,  consists  of  a  pair  (h,E) 
satisfying  the  following  conditions: 

1.  E  is  a  set  of  n  non-empty  domains,  E., ,  E^,  ...  E  ,  one  for  each 

7      1   2       n 

type; 
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2.  h(c)  e  E.  if  c  is  a  constant  symbol  of  type  i; 

3-   n(f)  e  E.  x  ...  x  E.   ■♦  E.  if  f  is  a  function  symbol  of  type 
xl        \         x 

(i^,  . . . ,  i^,  1 ) ; 

k.     h(p)  e  E.  x  ...  x  E.   ->  {true,  false]  if  P  is  a  relation 
11        T[ 

symbol  of  type  (i-,  . ..,  i  ) . 

The  following  notation,  used  by  Kreisel  and  Krivine  [10],  has  two 
advantages.   First,  it  allows  rigorous  definition  of  the  meaning  of  the 
quantifiers  without  introducing  names  of  objects  into  the  language  and 
without  resorting  to  intuition.   Second,  it  allows  interpretation  of  open 
formulas . 

Let  E  denote  the  set  of  all  maps  5  of  V  into  E  that  preserve  type, 

i.e.  such  that  5(V  )  <E  E.  •  Any  such  5  can  be  extended  to  a  map  of  T  into 

E  (also  denoted  by  5)  as  follows: 

1.  5(c)  =  h(c)  for  each  constant  symbol  c; 

2.  S(f(|1,  ...,  |k))  =  h(f)(S(^),  ...  8(|k)). 

V 
The  map  h  can  then  be  extended  to  a  map  of  all  wffs  of  K  into  E  .   If  F  is 


an  atom,  say  P(t  ,  ...,  t  ),  then 

h(F)  -  {8  e  EV  I  h(p)(5(t1),  ...,  8(t  ))  =  true}.  Otherwise, 

h(A  v  B)  =  h(A)  U  h(B), 
h(A  a  B)  =  h(A)  fl  h(B), 

h(-iA)  =  (EV  -  h(A))  =  complement  of  h(A), 
h(A  ->  B)  =  (EV  -  h(A))  U  h(B)  (so  that  A  ->  B  is  equivalent 
to  ((-j  A)  v  b)  as  one  normally  would  expect).  For  the  quantifiers,  h(3  x  A) 
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is  the  projection  of  h(A)  along  x,  that  is 

h(  3  x  A)  =  {5eE    there  is  some  S'eh(A)  and  6'  =  5  except 
possibly  at  x} . 
Finally, 

h(  V  x  A)  =  {SeE   |  every  5f  such  that  5'  =  8  except 
possibly  at  x  satisfies  5'eh(A)}. 
Again,  one  can  verify  that  h(  :3  x  A)  =  h(  —  V  x  —  A)  as  would  be  expected. 
Kriesel  and  Krivine  [10]  show  that  h(F)  depends  only  on  the  variables  which 
have  free  occurrences  in  F.   In  particular,  if  F  is  closed,  then  h(F)  =  E 
or  h(F)  =  0.   The  property  h(F)  =  E  corresponds  intuitively  to  F  being  true 
in  the  interpretation  (h, E). 
Definition  2. 3. 7 

If  A  is  a  set  of  wffs  of  K  and  (h, E)  is  a  realization  of  K  such  that 
h(F)  =  E  for  every  wff  F  in  A,  then  (h, E)  is  called  a  model  of  A.   A  model 
of  A  in  which  the  binary  relation  symbol  -     '        is  mapped  onto  equality 
between  objects  of  type  a  and  objects  of  type  P  is  called  a  normal  model. 
If  every  realization  of  K  is  a  model  of  A,  then  A  is  said  to  be  valid.   If 
A  has  at  least  one  model,  then  A  is  called  consistent.   If  A  has  no  model, 
A  is  said  to  be  inconsistent. 
Example  2Q.1  (continued) 

Let  a  realization  (h, E)  for  language  K  in  Example  2.J.1  be  defined 
as  follows:  E,  =  integers;  E  =  real  numbers;  h(SOJlT)  is  the  function  which 
yields  the  real  square  root  when  applied  to  a  non -negative  integer  and  0 
otherwise,  h(*)  the  function  which  yields  the  greatest  integer  <  the 


product  of  two  real  numbers;  h(P)  is  that  relation  which  is  true  of  an 

integer  i  if  and  only  if  i  is  >  0  and  h(=)  that  relation  which  is  true  of 

an  integer  i  and  an  integer  r  if  and  only  if  i  and  r  are  the  same  number. 

Then  for  any  SeE  ,  5(SQPT(x,))  is  the  real  number  which  is  •/©(x..)  if 

5(xx)  >  0  and  is  the  real  number  0  if  bix^)  <   0.   Next  5(*(SQiRT(x1),SQPT(x  ))) 

is  the  integer  S(x  )  if  5^)  >  0,  the  integer  0  if  5(x  )  <  0.  For  the  wffs, 

h(P(x1))  is  the  set  of  all  5eEV  such  that  5^)  >  0.  h(x1  =  *(SQPT(x1),SQPvT(x  _))] 

is  also  the  set  of  all  5eE  such  that  5(x, )  >  0.   For,  if  o(x,)  >  0,  by  the 

above,  6(*(SQF.T(x  ),SQPT(x  )))  is  o(x  ),  while  if  B(x  )  <  0, 

8(*(SQPT(x  ),SQJ?T(x  )))  is  0/  6(x  ).   Finally 

h(  ^x1(P(x1)  ->  (  =Iy1(x1  =  *(ar1,y1)))))  =  EV. 
To  see  this,  consider  an  arbitrary  5.   If  5(x, )  >  0,  then  5eh(P(x  ))  and  so 
5/(E  -h(p(x -,)))•   However,  n/s(x  )  is  a  real  number.   Let  6'  =  5  except  at  y, 
where  S'(y, )  =  v5(x  ).   Then  clearly  5'eh(x  =  *(j-,,j-,)),    and  so  by  the  defini- 
tion of  the  evaluation  of  an  existential  quantifier,  5eh(  3y.(x.  =  My-^y-,))) 
and  so  this  5  is  in  h(p(x  )  -»  (  3  y  (x  =  *(y-,,y-,))))«   Suppose  then  that 
5(x1)  <  0.   Then  S^h(P(x  ))  so  that  8e(EV-h(p(x  ))).   Thus,  every  8  is  in 
h(p(x, )  -*  (  3y  (x]  =  *(y  ,  y  )) ) ) .   By  the  definition  of  the  evaluation  of  a 
universal  quantifier, 

h(  Vx1(P(x1)  -  (  Sy^  =  ^(y1,y1)))))  =  EV. 

The  definition  of  atom  in  an  n-sorted  language  has  already  been 
given.  Definitions  of  literal  and  clause  for  such  languages  (the  analogues 
of  Definitions  2.2.8  and  2.2.9)  will  now  be  given. 
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Definition  2.3.8 

A  literal  is  an  atom  or  a  wff  (  -j  A)  where  A  is  an  atom. 

Definition  2.3.9 

Clause  is  defined  inductively  as  follows: 

1.  a  literal  is  a  clause; 

2.  if  c,  and  c  are  clauses,  then  so  is  (c  v/  cp). 

Definition  2.3. 10 

The  wff  (A  -*■  B)  a  (B  ->  A)  will  be  abbreviated  (A  «>  B). 

Definition  2. 3. 11 

A  unit  is  a  clause  containing  exactly  one  literal. 

The  next  three  theorems  from  Kreisel  and  Krivine  [  10]  are  central 
to  the  results  later. 

Theorem  2.3 »1  (The  Finiteness  or  Compactness  Theorem) 

Let  S  be  a  set  of  closed  wffs.   Then  S  has  a  model  if  and  only  if 
every  finite  subset  of  S  has  a  model. 

Definition  2.3-12 

A  wff  B  is  in  Skolem  normal  form  if  it  is  of  the  form 
VXl  ...  Vxk(F(x1,  ...  xk)) 
where  F(x_L,  ...  x^)  is  a  quantifier  free  wff.  Fix^    ...  x^)  is  called 
the  matrix  of  B. 

Theorem  2.3*2  (The  Uniformity  Theorem) 

A  wff  B  in  Skolem  normal  form  with  matrix  F(x  ,    ...,    x  )  is  in- 
consistent if  and  only  if  there  is  a  finite  set  of  k-tuples  of  terms 
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(t1  ,    ...,    tk  ),  (t^~,    ...,  tk  )  ...  (t1  ,    ...,  tk  )  such  that 

F(t  ,  ...,  t.  )  *  ...  *>   F(t  ,  . ..,  t,  )  is  an  inconsistent  wff  of  the 
propositional  calculus. 

Theorem  2. 3. 5  (The  Skolem  Normal  Form  Theorem) 

There  is  an  effective  procedure  which  will  transform  a  closed  wff 
B  into  a  wff  C  in  Skolem  normal  form  so  that  B  has  a  model  if  and  only  if  C 
has  a  model. 

2.h     Resolution  for  n-Sorted  Logic 

This  section  describes  the  extension  of  the  work  of  Robinson  [ 15] 
to  n-sorted  logic.   The  proofs  of  the  n-sorted  analogues  of  the  theorems  in 
that  paper  are  almost  word  for  word  the  same  as  Robinson's  original  proofs 
for  first-order  resolution.   Therefore,  no  proofs  will  be  given  here. 

Definition  2.^.1 

A  substitution  component  is  a  pair  U/V  where  U  is  a  term  and  V  is 
a  variable  of  the  same  type  as  U. 

Definition  2.JJ-.2 

A  substitution  is  a  set  of  substitution  components 

{\/\f    •••  \/\)   satisfying  V±  J  V  if  i  /  j. 

Definition  2. k.3 

The  application  of  a  substitution  a  to  a  string  of  symbols  E,  denoted 
by  Ea,  is  to  simultaneously  replace  each  occurrence  of  a  variable  V.  by  the 
corresponding  term  U. . 
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The  restriction  that  U  and  V  he  of  the  same  type  for  a  substitu- 
tion component  u/v  guarantees  that  the  application  of  a  substitution  to  a 
wff  yields  another  wff. 

Definition  2.k.k 

Let  a  =  (U^/x^  ...  U/xjT-j/y-j,  ...  Tk/yk)  and 

"X   =  (S-,/y-,,  •••  S  „/y„,W.,/z_,  ...  ¥  /z  ).   Then  the  composition  8\  is 
1'  1       r  .r  1'  1      m'  m  *■ 

1  '  1'      j  '  y    1  '  1'      k  '  k7  1'  1'      m'  m       i     i 

or  T.X   =  y.  for  some  i,  that  component  is  not  included. 

Because  n-sorted  substitutions  are  substitutions,  the  elementary 
facts  corresponding  to  5»5«l-5«5»^  in  [15]  hold.   That  is  (Ect)A.  =  H(oX)   for 
all  strings  E;  if  Ea  =  EX   for  all  strings,  then  a  -  X;    {dk)u   =  cr(\ju)  for 
all  substitutions  a,  X,    and  ju;  and  (A  U  B)cr  =  Ao   U  Bcr. 

Definition  2. ^.5 

The  disagreement  set,  B,  of  a  set  of  "well-formed  expressions,  A,  is 
the  set  of  well-formed  subexpressions  of  expressions  of  A  which  begin  at  the 
first  symbol  position  in  which  not  all  expressions  of  A  agree. 

Definition  2.  ^-.6 

If  a  is  a  substitution  such  that  Ao   is  a  singleton  set,  then  a   is 
called  a  unifier  of  A  and  A  is  said  to  be  unifiable.   Further  if  o   is  such 
that  for  every  unifier  (i  of  A,  u  =  oX   for  some  substitution  X,    then  o   is 
called  a  most  general  unifier,  abbreviated  MGU. 
The  empty  substitution  is  denoted  by  e. 


26 

Definition  2.U.7  (The  Unification  Algorithm) 

1.  set  an  =  e,  k  =  0,  and  go  to  2); 

2.  If  Aa  is  a  singleton,  set  a.  =  a,  and  terminate;  otherwise  go 

to  3.; 
3«   if  the  disagreement  set  of  Aa,  contains  a  variable  x  and  a  term 

t  (necessarily  of  the  same  type  since  the  expressions  of  Aa  are 

all  well"  formed)  such  that  x  does  not  occur  in  t,  then  set  a,  ,  = 

'  k+1 

a  (t/x),  k  =  k  +  1  and  return  to  step  2;  otherwise  terminate. 

Theorem  2A.1  (The  Unification  Theorem) 

If  A  is  unifiable,  then  the  unification  algorithm  halts  in  step  2 
and  cr  is  a  MGU. 

Definition  2.^.8 

Let  C  =  L .  v  C  and  D  =  M  v  D'  he  two  clauses  with  L  and  M  subsets  of 
C  and  D  respectively.   If  L  and  M  are  literals  and  one  is  the  negation  of 
the  other,  then  C  and  D  are  said  to  be  ground  resolvable  with  ground  resolvent 
C  v  D'.   Let  C  and  D  be  C  and  D  with  variable  names  changed  so  that  C  and 

D  have  no  common  variables.   Let  N  be  the  set  of  atoms  that  occur  in  L,  and 

M-, .   If  N  is  unifiable,  and  a  is  a  MGU  of  N,  and  if  L  a  and  M^  a     are 

complements,  then  C  and  D  are  said  to  be  generally  resolvable,  or  just 
resolvable,  with  resolvent  C' cr  v  D' a  . 

Theorem  2.^.2  (The  Lifting  Lemma) 

Let  A  and  B  be  two  clauses  with  instances  C  =  P  ^  C1  and  D  =  ~i  p  v  D' 
respectively.   Let  E  be  the  ground  resolvent  C'  v  D*.   Then  A  and  B  are 
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resolvable  and  have  a  resolvent  F  such  that  E  is  an  instance  of  F. 

Theorem  2.K.J>    (The  Resolution  Theorem  for  n-Sorted  Logic) 

Let  Vx  ...  V  x,  (C.  a>»  . .  *•  C  )  be  a  closed  inconsistent  wff  in 

Skolem  Normal  Form  with  matrix  clauses  C,,  Cp,  ...  C  .   Then  the  empty  clause 

can  be  deduced  from  CL,  C  ,    ...,    C  using  only  the  resolution  rule  of  inference. 

Chapter  3  presents  a  certain  equivalence  between  the  ^-calculus 
and  a  corresponding  theory  in  an  n-sorted  system.  Chapter  k   deals  with 
practical  applications  of  this  equivalence  using  the  resolution  rule. 
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3-   EQUIVALENCE  OF  \ -CALCULI  TO  n-SORTED  THEORIES 
In  this  chapter  it  will  be  shown  that  the  \-calculus  is  equivalent 
to  a  theory  in  an  associated  n-sorted  logic. 

3.1  Locally  Henkin  Frames 

Definition  3«1»1 

Let  E  be  a  subset  of  the  set  of  types  in  a  \-calculus.  The  type 
closure  of  E  is  the  smallest  set  of  types  which  contains  E,  the  type  TV,  and 
which  contains  a  and  |3  whenever  it  contains  (a  -*   (3). 

Let  S  be  a  set  of  expressions  of  a  \-calculus.   The  notation  T(S) 
will  denote  the  type  closure  of  the  set  of  types  occurring  in  S. 

Definition  3.1.2 

Let  S  be  a  set  of  expressions  of  a  \ -calculus.   Then  a  local  Henkin 
frame  relative  to  T(s)  consists  of  a  set  of  domains  H  and  a  mapping  g  of 
identifiers  satisfying  the  following: 

1.  H  contains  a  domain  for  each  type  in  T(s)  and  no  others; 

2.  g  is  a  map  of  identifiers  whose  types  are  in  T(S)  into  H 

satisfying  g(x)eH  if  X  is  of  type  a; 
3-   H(TV)  =  {true,  false}; 

k.      H,    ^N  consists  of  functions  from  Ha  to  HP; 

(a  -*  3) 

5.  g  maps  the  special  identifiers  onto  their  regular  meanings; 

6.  H  is  closed  under  g  relative  to  T(S);  that  is,  if  A  is  an 
expression  which  is  built  from  expressions  all  of  whose  types 
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are  in  T(S),  then  g(A)eH  where  g  is  extended  to  the  set  of  all 
formulas  over  T(S)  in  the  same  way  as  in  Definition  2.2.5. 
The  notions  of  local  validity,  local  consistency,  and  local  inconsistency  are 
the  analogues  of  validity,  consistency  and  inconsistency  given  in  Definition 
2.2.6.  Where  the  context  can  cause  no  ambiguity,  the  phrase  "relative  to  T(S)" 
will  be  omitted.  Note  the  similarity  to  the  definition  of  Henkin  frame.  As 
the  name  implies,  a  local  Henkin  frame  merely  restricts  one's  attention  to  a 
limited  set  of  domains.   The  following  theorem  shows  that  it  is  sufficient  for 
demonstrating  inconsistency  to  consider  only  local  Henkin  frames. 

Theorem  3Q«i 

Let  S  be  a  set  of  sentences  of  a  ^.-calculus .   Then 

1.  if  S  is  locally  valid,  S  is  valid,  and 

2.  if  S  is  consistent,  S  is  locally  consistent. 

Proof 

Suppose  S  is  consistent  and  (g,H)  is  a  Henkin  frame  for  which 
g(S)  =  true.   The  restriction  of  (g,H)  to  T(S)  is  a  locally  Henkin  model  of 
S  proving  2.   To  prove  1,  suppose  S  is  not  valid.   Then  (NOT  S)  is  consistent. 
By  2,  (NOT  S)  is  locally  consistent  contradicting  the  assumption  that  S  is 
locally  valid. 

Theorem  3* 1*1  shows  that  for  the  purpose  of  determining  validity 
or  inconsistency,  one  can  just  as  well  restrict  one's  attention  to  locally 
Henkin  frames.   This  is  the  approach  taken  in  the  remainder  of  this  work. 
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3*2  Representation  of  the  ^.-Calculus  in  an  n-Gorted  System 

A  translation  of  the  X-calculus  into  n-sorted  logic  will  now  be 
given. 

Definition  3.2.1 

Let  S  be  a  set  of  A.-calculus  sentences  and  T(s)  the  type  closure 
of  S.   Then  the  associated  n-sorted  language,  K,  contains  the  following 
symbols  and  types : 

1.  for  each  type  oeT(S),  K  contains  a  type,  also  denoted  by  a; 
note  that  while  some  types  of  T(S)  are  complex,  all  types  in 
K  are  simple; 

2.  for  each  type  a  =    (0  -»  TRUTHVALUE)  in  T(S),  one  specialized 

binary  relation  symbol,  p     , 
3-   for  each  type  a  =  (P  ->  5)  in  T(S),  5  ^  TRUTHVALUE,  one 

specialized  binary  function  symbol  f  '  '    '   called  the 
function  associated  with  (p  _>  g). 

k.      for  each  \-calculus  identifier,  X  ,  a  constant  symbol  c 

and  a  variable  v„  both  of  K-type  a. 

5.  for  each  type  a   in  T(S),  the  binary  equality  relation 
„_(a,  a)  it. 

6.  a  set  of  function  symbols  to  be  described  below. 

In  2  and  3  above,  the  types  p  and  5  will  be  K- types  because  they  will  be  in 
T(S),  a  type  closure  in  which  (0  -»  TRUTHVALUE)  or  (p  -»  5)  occurs.   Superscripts, 
when  written,  will  indicate  types.   Where  no  ambiguity  may  arise  superscripts 
will  be  omitted.  We  may  assume  without  loss  of  generality  that  in  any  formula 
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of  a  \- calculus,  no  two  bound  identifiers  are  equal  and  if  X.YB  is  subformula 
of  A,  that  Y  occurs  in  A  only  within  \YB. 

The  following  definitions  are  made  with  respect  to  a  particular 
wff  A,  in  preparation  for  the  actual  translation  of  A. 

Definition  3.2.1 

An  identifier  X  which  occurs  immediately  to  the  right  of  a  X   in  a 
subexpression  (A.XB)  of  A  is  called  a  marked  identifier  of  A. 
Recall  that  (V  XA)  and  (  =3  XA)  are  abbreviations  for  formulas  containing  \, 
so  that  in  each  of  the  above  cases,  X  is  a  marked  identifier. 

Definition  3-2 «2 

The  term  associated  with  a  subexpression  B  of  A  not  of  type  TV  is 
defined  inductively  as  follows: 

1.  the  term  associated  with  an  identifier  X  is  v  if  X  is  a  marked 

A 

identifier,  c  otherwise; 
X 

2.  the  term  associated  with  an  application  (GD)  is  f(t  , t  )  where 

G   D 

t  is  the  term  associated  with  G,  t  is  the  term  associated 

with  D,  and  f  is  the  function  symbol  associated  with  the  type 
of  the  wff  G; 
3-   the  term  associated  with  (XXB)  is  d(u  ,  ...,  u  )  where  d  is  a 

new  function  symbol  (allowed  for  in  6  of  Definition  3*2.1) 

and  u, ,  ...  u  are  the  variables  of  K  associated  with  the 
1'  m 

marked  identifiers  of  (\XB)  other  than  X. 
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Definition  p. 2.$ 

The  translation  t,  of  a  subexpression  B  of  A  of  type  TV  is 

calculated  as  follows : 

1.  if  B  is  an  atom  (FC)  of  A,  t-^FC)  is  P(t  ,t  )  where  t  and  t 

are  the  terms  associated  with  F  and  C  respectively  and  P  is  the 
relation  symbol  of  K,  of  type  ((a  -*  TV),  a)  where  F  is  of  type 

(a  -  TV); 

2.  t,  maps  the  logical  connectives  and  EQUAL  onto  their  analogues 

1  a 

of  K;  thus,  for  example,  t,((QR  B,)  B  )  is  (t-,(B  )  v   t,(B  )) 
and  T1((EQUALa  B1)  Bg)  is  t^  =  (a'a)  t£  . 

Definition  3.2.k 

The  defining  axiom  for  a  subexpression  (A.XB)  of  A  is  one  of  the 
following  two  wffs  of  K: 

1.  yv-V^  ...  Vu  (t,  (B)  +*   P(d(u,,  . ..  u  ),0)  if  B  is  of 
v  X    1       ml  1'      m  '    X 

type  TV; 

2.  V  vx  V  ux  ...  Vum  (tB  =  f(d(uL,  ...  um),  vx))  where  t£  is  the 

term  associated  with  B  and  f  is  the  function  symbol  associated 
with  the  type  of  (\XB)  if  B  is  not  of  type  TV. 
In  both  cases,  d(u.,  ...,  u  )  is  the  term  associated  with  AXB. 

The  translation  of  A  can  now  be  given. 

Definition  3-2-5 

The  translation  t  of  A  is 

t-,(A)  aC  a  ...  AC 
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"where  C  ,  C  ,  . . .  C,  are  the  defining  axioms  for  the  subf ormulas  of  A 

beginning  with  \. 

While  CHOICE  is  used  mainly  to  define  the  quantifiers,  it  can  be 
translated  as 

Vz  Vx  (3  yP(x,y)  -*  (z  =  f  (CHOICE,  x)  -»  P(x,z))). 

As  was  noted  on  page  13  the  X-calculus  and  n-sorted  logic  differ 
in  their  treatment  of  marked  identifiers.   In  the  ?v-calculus,  an  unmarked 
identifier  acts  as  a  constant  while  that  same  identifier  when  marked  plays 
the  role  of  a  variable.   In  n-sorted  logic  there  are  no  such  dual  symbols, 
and  care  must  be  taken  to  use  the  associated  variable  or  constant  as  necessary. 

Example  3«2. 1 

Suppose  A  is  the  formula 

(Vm((Xn((EQUAL(  (TIMES  TWO)  ((SUM  ID)  n)))((TIMES  n)((PLUS  n)  ONE))))  m)) 

The  types  and  their  associated  identifiers  are: 

1.  INTEGER  -  n,  m,  ONE,  TWO, 

2.  (INTEGER  ->  INTEGER)  -  ID, 

3-   (INTEGER  -*  (INTEGER  -»  INTEGER))  -  TIMES,  PLUS,  and 
k.       ((INTEGER  ->  INTEGER)  ->  (INTEGER  -+   INTEGER))  -  SUM. 
The  marked  identifiers  of  A  are  n  and  m.   Let  f  be  the  function  associated 

with  (INTEGER  -*  INTEGER),  f  the  function  associated  with  (INTEGER  -»  (INTEGER 

->  INTEGER)),  and  f  the  function  associated  with  ((INTEGER  -*  INTEGER)  ^ 

(INTEGER  -*  INTEGER)).   Let  P  be  the  relation  symbol  of  type  ((INTEGER  -»  TV), 
INTEGER).   The  term  t  associated  with 


((TIMES  TWO) ((SUM  ID)  n)) 


is 


fx(f2 (TIMES,  TWO),  fx(f5(SUM,  ID),  n)); 


and  the  term  tp  associated  with 


((TIMES  n)((PLUS  n)  ONE)) 


is 


f-  (f0( TIMES,  n),  fn(f_(PLUS,  n),  ONE)) 


.±^2^^^,     »„     -1v-g 


The  symbol  EQUAL  applied  to  the  two  subexpressions  is  translated  by  t,  onto 


(t,  .  t2). 

The  defining  axiom  for  the  \ -expression  is 

(Vn  ((t1  =  t   )**   P(d,n))). 

The  wff  A  has  one  atom,  namely  the  ^.-abstraction  applied  to  m.   Then  t(A)  is 

Tl(A)  a  (yn((t1  =  t2)**  P(d,n))) 


which  is 


(  VmP(d,m))  -  (  Vn((t1  =  t&)   -  P(d,n))) 
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The  only  variables  occurring  in  t(A)  are  variables  associated  with 
marked  identifiers.   Such  variables  occurring  in  a  defining  axiom  are  all 
universally  quantified.  For  a  variable  to  occur  in  t  (a),  the  corresponding 

identifier  must  be  quantified  by  V  or  3.   These  occurrences  of  variables  are 
bound  when  V  or  3  is  translated  by  x., .  All  other  subexpressions  beginning 

with  X   are  assigned  associated  terms  as  in  Definition  3-2.2  before  the 
calculation  of  t,  (A) .   Thus,  all  occurrences  of  variables  in  t(A)  are  bound 

occurrences,  and  t(A)  is  a  closed  wff  of  K. 

It  is  possible  to  treat  the  A.-type  TV  as  another  type  in  K.   This 
is  necessary,  for  example,  if  the  formula  being  translated  includes  an  identifier 
of  type  (TV  ->  a)  distinct  from  the  special  identifiers.   In  this  case,  the 
logical  symbols  of  the  ^.-calculus  are  also  treated  as  constant  symbols,  and 
certain  wffs  in  the  language  of  K,  which  will  be  described  later,  must  be 
added  to  insure  that  these  constant  symbols  are  Interpreted  correctly.  However, 
in  most  of  this  work,  it  is  assumed  unnecessary  to  treat  TV  in  this  way. 

3-3  The  Associated  Realization  of  K 

Throughout  this  section,  all  Henkin  frames  are  considered  to  be 
local. 

Definition  3»3»1 

Let  (g,H)  be  a  (local)  Henkin  frame  for  the  A.-calculus.   The 
associated  realization  (h, M)  of  K  is  defined  as  follows: 

1.  M  =  H  for  each  type  a; 

a        a 

2.  h(cy)  =  g(x)  for  each  constant  symbol  c„  of  K; 
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3«  for  the  relation  symbol  P  of  type  ((a  -*  TV"),  a),  h(P)  holds 
for  a  pair  (a, b)  if  and  only  if  a(b)  holds;  that  is,  for  a 
predicate  a  in  H/  _^   _  ^  and  an  object  b  in  H  ,  h(P)  (a,b)  is 

true  if  and  only  if  the  predicate  a  applied  to  the  object  b  is 

true; 
k.      for  the  function  symbol  f  of  type  ((a  -►  p),  a),  p  ^  TV,  h(f) 

is  the  function  defined  by  the  equation  h(f)  (a,b)  =  a(b); 
5.   for  the  function  d  associated  with  the  wff  >-YB  with  marked 

identifiers  Y,  Y  ,  ...  Y,,  h(d)  is  the  function  defined  by  the 

equation 

h(d)  (suj,  ...,  a^)  =  g'(\YB) 

where  g'  =  g  except  at  Y  ,  ...  Y  where  g'(Y. )  =  a.. 

In  the  following  (g,H)  and  (h,M)  are  as  above. 

Lemma  3»3«1 

Let  A  be  a  formula  of  the  X.-calculus  and  B  a  subformula  of  A.  Let 

t  be  the  term  associated  with  B.   Let  Y, ,  ...  Y  be  the  marked  identifiers  of 

1'      m 

A  and  un,  ...  u  the  associated  variables  of  K.   Let  5  be  in  M  .   If  5(u. )  = 
g(Y±),  i  =  1,  ...  m,  then  S(t)  =  g(B). 

Proof 

By  induction  on  the  construction  of  B  (or  t); 

Case  1 

B  is  an  identifier.   If  B  is  marked,  then  B  is  Y.  for  some  i  and 

t  is  u. .   By  hypothesis,  5(u.)  =  g(Y.)«   If  B  is  not  marked,  then  t  is  c  . 
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By  the  definition  of  h  and  8,  &(c_)  =  h(C_)  =  g(B) 


Case  2 


B  is  \YB ,,  B  not  of  type  TRUTHVALUE.   The  term  associated  with  B 


is  d(un,  ...  u  ).   By  the  definition  of  h(d),  and  the  fact  that  S(u.)  = 
l7      m  '  v  i ' 

g(Y. ),  where  Y.  are  the  marked  variables  of  B, 

8(t)  =  h(d)(g(Y1),  ...  g(Ym))  =  g(\YB1). 


Case  3 


B  is  an  application  (FD)  not  of  type  TV  and  the  terms  t  ,  t 


associated  with  F  and  D  satisfy  5(t  )  =  g(F)  and  5(t  )  =  g(D).   Then  t  is 

t  JJ 

f(tF,  tD)  and  6(t)  =  h(f)(5(tF),  6(tD))  =  h(f ) (g(F),g(D)) .   By  part  k   of 

the  definition  of  M,  h(f)(g(F),  g(D))  =  g(F)(g(D))  =  g(FD)  =  g(B). 

This  completes  the  proof  of  the  lemma.   Clearly  by  the  way  h(d) 
is  defined,  the  result  holds  as  g  is  allowed  to  vary  over  assignments  of 
identifiers . 

Lemma  3-3-2 

Let  A,  B,  Y., ,  ....  Y  ,  u, ,  ....  u  be  as  in  Lemma  3«3«1>  and 

suppose  B  is  of  type  TV.   Then  for  each  6eM  ,  5eh(r  (b))  if  and  only  if 

g'(B)  =  true  where  g'  -  g  except  at  Y,,  ...,    Y  and  g'(Y. )  =  5(u. ). 


Proof 

Suppose  B  is  an  atom  (FD).   Let  t  and  t  be  the  terms  associated 
with  F  and  D  respectively.   By  Lemma  3.3.1,  S(t  )  =  g'(F)  and  8(t  )  =  g'(D). 
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Now,  t1(b)  is  the  n-sorted  atom  P(t„,  t  ).   Suppose  Seh(P(t  ,  t  )).  Then,  by 

3  of  Definition  3.3.1,  5(t  )  (6(t  ))  holds;  that  is  gf(F)(g*(D))  holds,  or 

g'(F)(g'(D))  =  gf(FD)  =  true.  Conversely,  suppose  g'(FD)  =  true.  Then, 
because  gT(F)  =  &(t  )  and  g'(D)  =  B(t  ),  8(tF)(B(tD))  holds.  So,  by  3-  of 

Definition  3-3.1,  h(P)(&(t  ),  8(tD))  holds  and  5€h(P(t  ,t  )). 

Suppose  B  is  ((EQUAL  F. )  F0).   Then  t. (B)  is  (tv     =  t_  )  where  t_ 

12         1        Fi    F2        Fi 

is  the  term  associated  with  F. .   Because  (h, M)  is  a  normal  realization  of  K, 

h(=)  is  equality.   The  proof  for  this  form  of  B  now  proceeds  exactly  as  above. 
Suppose  B  is  ((OR  B, )  Bp),  and  suppose  the  lemma  holds  for  B.  and 

Bg.   ^((OR  B1)  B2)  is  (^(B^  v  T-^Bg)).   Then 

5€h(r1(B1)  v  ^(Bg))  <=>  5eT1(B1)  or  bex^B^ 

j. — <  g,(B1)  =  true  or  g'(Bg)  =  true 

<=^   g'((OR  Bx)  Bg)  =  true. 

The  cases  for  the  other  logical  connectives  and  quantifiers  are  handled 
analogously. 

In  the  above  lemma,  note  that  if  t,(b)  is  a  closed  wff  of  K,  then 

h(x,(B))  is  either  M  or  cp.   If  g(B)  is  true,  then  there  is  at  least  one  5  in 

h(x  (B)),  namely  the  5  such  that  5(u.)  =  g(Y.  ).   Then  h(«r  (B))  is  not  empty 

V 
and  therefore,  must  be  M  . 
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Theorem  3*3«1 

With  (g, H)  and  (h,M)  as  above,  for  every  sentence  A  of  the  ^.-calculus, 
g(A)  =  true  if  and  only  if  1i(t(A))  =  M  . 


Proof 

t(A)  is  a  conjunction 

where  C.  is  the  defining  axiom  for  a  ^.-subexpression  of  A,  say  (A.Y.B.  ).  Recall 
that  t-,  (A)  is  a  closed  wff  of  K,  so  by  the  remark  above,  h(Tn(A))  =  M  if  and 
only  if  g(A)  =  true.   Then  it  is  sufficient  to  prove  the  result  for  each  C. . 
There  are  two  cases. 


Case  1 


B.  is  of  type  TV.   Then  C.  has  the  form 
1  l 


V  vy  V  ux  ...  Vum(T1(B  )«*P(d(u1,  ...,  u  ),  vy  )). 

i  i 

Consider  the  matrix  of  this  wff 

T1(Bi)«-  P(d(u1,  ...,  um),  vy  )  , 

i 

and  let  5  be  in  M   .      Now,    d(un,    ....    u   )   is   the  term  associated  "with   (XY.B.  ) 

'   v  1'    '      my  l  l 

By  Lemma  J. 3.1, 

6(d(u_,  ...,  u  ))  =  g'OY.B.) 
1'       m         li 


where  g'(Z.)  =  S(u.)  for  the  marked  identifiers  Z.  of  KY.B     other  than  Y. 
3  J  3  l  l  l 


itself  and  gf  =  g  elsewhere.   Let  g"  =  g*  except  at  Y.  where  g"(Y. )  =  8(v  ). 

i 
Clearly  g"(\Y±B±)   =  g1  (M±B±)   because  in  the  evaluation  of  each,  the 

particular  value  of  g,(Yi)  or  g"(Yi)  is  immaterial  (see  6  of  Definition  2.2.5). 


Then 


g"(*.Y,B  )  =  8(d(u.,  ...  u  )). 


Furthermore,  gu ((^Y±B±)Y±)   =  g"(Bi).   Now  suppose  8eh(T]L(B]L)).   By  Lemma  3.3-2, 
g"(B.)  must  be  true.   That  is, 


g"((\Y.B.)Y. )  =  true, 
ill 


But  gn((A.Y.B.)Y.)  =  g"(^YiB.)(g"(Y.))  =  b(d(u±f    ...  u  ))(6(ty  )).   Then  by 

i 

the  definition  of  h(p)  (3  of  Definition  3.3-1),  h(p)(8(d(u  ,  ...  u  )),8(v  )) 

i 

is  true,  and  so  Seh(p(d(u  ,  ...  u  ),v  )).   Thus, 

i 

5eh(T1(B.)  -*  P(d(u  ,  ...  u  ),v  )).   If  5^h(T-,(B.)),  then  the  above  also 

i 

holds.   Since  8  was  arbitrary,  5eh(T,  (B. )  ■*  P(d(u,,  ...  u  ),v   )).   Similar 
remarks  show  that  for  any  8  in  M  , 


8eh(p(d(u1,  ...  um),vY>)  ->  r1(Bi)). 


Then,  hO^B^  ~  P(d(ux,  ...  um),vY  ))  =  M  ,  and  by  the  definition  of  the 

i 

evaluation  of  the  universal  quantifier, 

h(C.)  =  MV  also, 
l 


kl 


Case  2 


B.  is  not  of  type  TV.   Then  C.  has  the  form 


VVY.  VV  "•  Vum  ^B.  =  f(d(ul>  '•"'  Um^  VY  ^ 


Again,  consider  the  matrix  of  this  wff, 

tB  =  f(d(u1,  ...,  um),  vy  ) 
i  i 

Let  5  be  in  M  .    If  g'  and  g"  are  defined  as  in  Case  1,  then  it  follows  that 

6(d(un,  ...,  u  ))  =  g'(\Y.B.)  =  g"(\Y.B. ) 

1'     '   m         li         li 


and  g"((^Y.B.)Y.)  =  g"(B.). 


f  is  the  function  symbol  associated  with  the  type  of  (A.Y.B.)*   Therefore,  by  h 
of  Definition  3«3«1> 

h(f)(5(d(Ul,  ...,  um),  B(v  ')))  =  g"(M.Bi)(g"(Yi)) 


=  g"((\Y.B.)Y. ) 

ill 


-6(tB_). 

1 

The  last  equality  follows  from  Lemma  5«5»1«   Tiie  first  and  last  items  in  the 
above  string  of  equalities  state  that 


h(f)(B(d(un,  ...,  u  )),  8(vv  ))  =  5(t_  ) 


Ll> 


Then,  5eh(t   =  f(d(u..,  . ..,  u  ),  v  )).  As  in  Case  1,  5  was  arbitrary,  so 
i  i 

that  h(t_  =  f(d(u  ,  ...,  u  ),  v   ))  =  M  ,  and  hence,  h(C.)  =  M  also.   This 

£5.  -L         mi.  1 

1  1 

completes  the  proof  of  the  theorem. 

In  the  proofs  of  the  two  lemmas  and  the  theorem  it  was  assumed 
that  the  type  TV  never  occurred  as  an  argument  type,  i.e.,  there  were  no 
symbols  of  type  TV  -*  a,  other  than  the  standard  logical  connectives.  If  this 
assumption  is  not  valid,  TV  is  treated  just  as  any  other  type,  the  two-place 
P  relation  symbols  are  replaced  by  three-place  relation  symbols,  and  P(x, y) 
becomes  P(TRUE,x,y),  etc.   Then  the  proofs  of  Lemma  3«3»1  and  3«3«2  and  Theorem 
3»3 •!  still  hold.  Also  note  that,  in  the  proof  of  Lemma  3.3.1,  no  mention  was 
made  of  the  translation  of  CHOICE.   This  is  because  CHOICE  does  not  seem  to 
be  used  except  for  quantifiers,  which  are  handled  separately. 

Theorem  3*3 •!  shows  that  if  A  is  a  sentence  of  the  \-calculus  which 
has  a  Henkin  model,  i.e.  a  Henkin  realization  (g, H)  for  which  g(A)  =  true,  then 
t(A)  has  a  model.   In  particular  one  model  of  t(A)  is  the  realization  of  K 
associated  with  (g,H).   The  converse  is  not  true.   That  is,  there  may  exist  a 
model  (h, M)  of  t(A),  but  the  Henkin  frame  (g,H)  which  one  would  expect  to 
correspond  to  (h, M)  is  not  a  model  of  A  itself.   There  are  two  ways  this 
situation  might  arise: 

1.  the  objects  of  M/    ny   are  not  actual  functions  from  M  into 

(a  -*  P)  a 

MR,  and 

2.  the  pair  (g, H)  which  would  be  expected  to  correspond  to  (h,M) 
does  not  satisfy  closure,  that  is,  there  may  be  a  formula  B  of 
the  \-calculus  for  which  g(B)  ^  H.  These  will  be  illustrated 
in  the  following  example. 


h3 

Example  5«3»1 

Let  the  \-calculus  contain  the  two  types  REAL  and  (REAL  -*  TV), 
and  let  A  be  the  formula 

VX((  IMPLIES  ((EQUAL  X)Y))(V  Z(N0T(ZX)))) 
where  X,  Y  and  Z  are  identifiers.   Then  t(A)  is 

V  VX^YX  =  °Y  "*  V  VZ^  V(kYZ>    VX^' 
The  wff  A  can  have  no  Henkin  model  (g, H)  because  the  predicate  "equal  to  g(Y)" 
which  is  the  interpretation  of  (\x( (EQUAL  X)  Y))  must  be  in  H,         .. 

Clearly  then,  if  g(x)  =  g(Y),  there  is  a  predicate  in  E,      .  „,.>  which 

^KJliAij  ->  1 V  ) 

when  applied  to  g(X)  yields  true,  namely  g(A.X(  (EQUAL  X)y)).   Hence,  g(A) 

must  be  false.   Consider  two  realizations  of  K.   First,  let  (h,M)  be  a 

realization  of  the  associated  n-sorted  logic  such  that  M^-pAT  =  m(-r-fat   w^i  = 

integers,  h(C  )  =  0,  and  let  h(p)  be  the  binary  relation  which  is  identically 
false.   In  this  case  h(r(A))  =  M  but  the  objects  in  M/         x  are  not 

predicates  over  the  objects  in  M_    .   For  the  second  realization,  let 

]VL    =  the  real  numbers,  M,         x  =  {"less  than  0",  "greater  than  0"}, 

and  h(c  )  =  0.   Let  h(p)  be  identically  false.  Again,  h(x(A))  -  M  ,  but 

the  pair  (g,H)  that  one  would  expect  to  correspond  to  (h, M)  is  not  a  Henkin 

frame,  because  M /-.,-,..,.   mTTx  does  not  contain  the  predicate  "equal  0".   In 
(REAL  ->  TVj 

the  realization  (h, M),  this  predicate  corresponds  to  the  open  wff 

Vvz  —,  P(vz,x). 

In  order  to  guarantee  that  models  of  t(A)  will  correspond  to  Henkin 
models  of  A,  a  set  of  axioms  is  added  to  t(a).   This  is  strictly  a  theoretical 


device.   It  has  already  been  pointed  out  by  Robinson  and  Wos  [Ik]   how  the 
addition  of  axioms  can  severely  decrease  the  efficiency  of  theorem-provers . 
Later  it  will  be  suggested  that  new  rules  of  inference  replace  these  axioms 
in  the  same  manner  that  Robinson  and  Wos  replaced  the  equality  axioms  by 
paramodulation.   These  axioms  are  called  the  Henkin  axioms  and  are  denoted 
by  I. 

Definition  3-3-2 

The  set  of  Henkin  axioms  I  consist  of  the  following: 

1.  one  axiom  for  every  formula  \YA  of  the  \-calculus,  namely 
the  defining  axiom  for  \YA  given  in  Definition  3-2.4; 

2.  for  every  type  (a  •*  3),  the  axiom  of  extensionality 

Vx1  Vx2(V  z(f(xrz)  =  f(x2,z))  -  xx  =   x2) 

where  f  is  the  function  symbol  associated  with  (a  -*  3)  or 
V  x±   Vx2(V  z(p(X;l,z)  ~  P(x2,z))  ->  x1  =  x2) 

where  P  is  the  relation  symbol  associated  with  (a  ->  3)  de- 
pending on  whether  or  not  3  =  TV; 
3-   the  TV  axioms,  in  the  case  the  type  TV  is  needed  as  a  separate 
type  of  K;  these  consist  of  the  wff 

VV(V=  CTRUE  "V=  CFALSE) 

and  the  obvious  set  of  wffs  which  define  the  constants  c.^, 

UK 

CAND'  CIMPLIES'  °NOT>  CV>  e3* 

The  defining  axioms  guarantee  the  closure  of  the  corresponding 
Henkin  frame  (g,  H).   Clearly,  an  application  of  a  function  to  an  object  in 
its  domain  does  yield  an  object  in  the  range,  so  that  application  alone  can- 


^5 
not  produce  an  object  outside  the  set  of  domains.   Since  all  the  identifiers 
also  represent  objects  in  the  domain,  the  only  way  that  entirely  new  objects 
could  be  defined  is  through  abstraction.   By  including  t, (\YA),  one  is  in- 
cluding in  the  language  K  the  term  associated  with  X.YA,  whose  interpretation 
will  be  just  the  object  represented  by  \YA.   The  axioms  of  extensionality 
ensure  that  only  one  representation  of  a  particular  function  or  predicate  is 
present.   The  TV  axioms  are  needed  only  when  TV  is  needed  as  a  type  in  K 
(e.g.  when  there  is  a  type  (TV  -*  a),    other  than  the  standard  logical  connec- 
tives, where  arguments  of  type  TV  are  present).   It  will  be  assumed  for  the 
rest  of  this  work  that  these  are  not  needed  and  axioms  of  type  3  are  not  in- 
cluded in  I. 

Lemma  3»3«3 

To  any  normal  model  (h,,N)  of  the  axioms  of  extensionality  there 
corresponds  a  normal  model  (h  ,  M)  of  these  axioms  satisfying  the  following 
conditions : 

1.  there  is  a  1-1  map  g  from  N  onto  M; 

2.  the  objects  of  M,    n\   are  actual  functions  from  M  to 
MR  if  P  /  TV  or  actual  predicates  over  M  if  3  =  TV; 

3-   for  any  wff  A  of  K,  h  (A)  =  g(h  (A))  where  g(S)  is  the  map  in 

MV  such  that  g(6)(x)  =  g(s(x)); 
k.     h^(f)(a,b)  is  a(b)  for  aeM/    oN  and  beM  and  f  the  function 
symbol  associated  with  (a  ->  3);  h(p)(d,  e)  is  true  if  and  only 

if  d(e)  is  true  for  deM/    mTrN  and  eeM  and  P  the  relation 

(a  -*  TVJ       a 

symbol  associated  with  (a  ■*  TV). 


k6 

Tvroof 

The  domains  M  are  defined  by  induction.   If  u   corresponds  to  a 

simple  type  of  the  \-calculus,  i.e.  u   is  not  of  the  form  (a  —   p),  then 

M  =  N  .   That  is,  the  domains  of  individuals  are  the  same  in  M  and  N. 

If  g  is  taken  as  the  identity  map  here,  then  g  is  1-1.  Suppose  M  and 

MQ  have  already  been  given  and  are  in  1-1  correspondence  with  N  and  NQ 
P  oc  p 

respectively  by  g.   Suppose  P  ^  TV.   Let  aeN/   ^  „>,.   Then  for  every  beN 

there  is  exactly  one  ceNR  such  that  c  =  h  (f)(a,b).   Define  g(a)  to  be  that 

function  such  that  g(a)(g(b))  =  g(c)  if  and  only  if  c  =  h,(f)(a,b).   Suppose 

g  as  defined  so  far  is  1-1.   Since  (h,,N)  satisfies  the  axioms  of  extension - 

ality,  given  any  object  aeN/    _>.,  no  other  object  of  N,    0>,  will  have 

[a  -»  p)  {a  •*  p) 

the  same  value  for  every  argument  in  N  .   Therefore,  since  g  is  1-1  between 

N  and  M  and  1-1  between  I\L  and  M0,  two  distinct  objects  a  and  b  in  N/    Qs 
a     a  p     0'  °  (a  -»  P) 

yield  distinct  functions  in  M,    0%.   Of  course,  each  object  of  N/    Q>, 

[a  •*  PJ  '       °  (a  ->  P) 

does  give  rise  to  a  function,  and  thus,  g  is  also  1-1  between  N/     s   and 

M/    „s.   The  analogous  process  is  carried  out  in  the  case  P  =  TV.   By 
la  ->  p) 

finite  induction,  the  result  of  1  is  proved.   Clearly  2  holds  due  to  the 

way  objects  of  M  were  defined.   Finally,  it  remains  to  define  h  for  the 

symbols  c,  P,  and  f.   Define  h„(c)  to  be  g(h,(c)).   hp(P)  is  that  relation 

which  holds  for  a  pair  (d,  e)  in  M,    rm^xM  if  a*"3-  only  if  nn  (p)  holds  for 

[a  —  TV;  a  -L 

(g   (d),g   (e)).   hp(f)  is  that  function  whose  value  for  a  pair  (a,b)  in 
M(a  ^  r)xMq  is  g((h1(f))(g"1(a),g"1(b))). 

Now  let  aeM/  >  a)'      By  the  definition  of  the  objects  in  M/   ^  rn, 
the  function  a  applied  to  an  argument  beM  yields 

g(h1(f))(g-1(a),g-1(b)). 
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This  Is  just  hp(f)(a,b),  and  so  condition  k     for  the  function  symbols  holds. 
Similar  remarks  prove  condition  k-     for  relation  symbols.   There  remains  only 
the  verification  of  property  3  and  this  is  done  by  induction.   Clearly,  if 
A  is  an  atomic  formula,  property  3  holds  by  the  definition  of  h  (P)  and 
hp(f).   Suppose  A  is  the  formula  B  ^  C  and  property  3  holds  for  B  and  for  C. 
Then 

Seh  (A)  if  and  only  if  5eh  (b)  or  6eh  (c) 

if  and  only  if  g(5)eh  (B)  or  g(s)eh_(C)  by  the  induction 

hypothesis 
if  and  only  if  g(s)eh  (A). 
Similar  remarks  prove  3  for  B  *  C,  B  -+  C  and  — >  B.   Suppose  A  is  the  formula 
3  xB.   Then  5eh  (A)  if  and  only  if  there  is  a  8'eh,(B)  such  that  8'  =  8  every- 
where except  possibly  at  x.   By  the  induction  hypothesis  8T€h-,(B)  if  and  only 
if  g(S')£h  (b).   Clearly  g(S')  =  g(&)  everywhere  except  possibly  at  x  so  that 
g(8)ehp(A).   Thus  8eh, (A)  implies  g(&)ehp(A).   An  identical  argument  shows 
oehp(A)  implies  g   (&)eh  (A).   Since  g  is  1-1,  this  gives  the  result.   Finally, 
VxB  is  ->dx-jB  and  the  result  holds  for  -»  and  3  x.   This  proves  the  lemma. 

The  necessity  for  including  the  axioms  of  extensionality  can  be 
seen  by  the  following  example. 
Example  3»3»2 

Suppose  there  are  two  types,  a   and  (a  -»  a).   Let  f  be  the 
function  associated  with  (a  ■*  a)«   Let  =  be  the  only  relation  symbol.   Define 

a  normal  realization  of  K  as  follows:  N  =  N,     N  =  integers; 

a    (a  -»  a) 

h(f)(j,k)  =  k  +  1  for  all  j  and  k.   Thus  each  integer  j  names  the  successor 

function.   Because  all  integers  in  N/     \  name  the  same  function  there  can 

[a  -»•  a) 

be  no  realization  M  where  M,     \  contains  actual  functions  which  is  1-1  with 

{a  -*  a) 


N/     \.   Also,  no  such  realization  can  satisfy  exactly  the  same  closed  wffs 
(a  -  a) 

as  (h,  N).   In  particular,  the  formula 

3  x±   3  x2(V  y(f(Xl,y)  =  f(x2,y))  *  a^  /  xg) 

is  satisfied  by  (h.N).  But  for  any  realization  (h0,M)  for  which  M,  ^ 

<?■  \cl  •*  a.) 

consists  of  actual  functions,  the  above  formula  cannot  be  satisfied  because 
5eh  (V  y(f(x  ,y)  =  f(x  ,y)))  would  require  6(x  )  and  5(x )  to  have  the  same 
value  for  each  argument.   That  is,  S(x  )  and  6(x )  would  be  the  same  func- 
tion, and  so  5(x1)  =  5(x )  and  5eh_(x  =  x  ).   Thus,  distinct  "names"  of 
functions  may  behave  quite  differently  from  each  other.   The  axioms  of 
extensionality  ensure  that  distinct  names  name  distinct  functions,  i.e. 
that  each  function  has  only  one  name. 

The  next  theorem  is  the  major  result  of  this  work.   The  notation 
used  above  is  still  in  force. 

Theorem  3«3-2 

Let  A  be  a  sentence  of  the  ^.-calculus.   Then  there  is  a  local  Henkin 
frame  (g,H)  in  which  g(A)  =  true  if  and  only  if  there  is  a  normal  model  of 
t(A)  U  I. 
Proof 

Suppose  (g,H)  is  a  Henkin  frame  in  which  g(A)  is  true.   Let  (h,M) 
be  the  associated  realization  of  K.   By  Theorem  3*3  »1> 

h(T(A))  =  MV, 
that  is,  (h, M)  is  a  model  of  A.   Clearly  (h,M)  is  normal.  Also, 

M(a-P)  =  H(a-P) 
and  so  M,  ^  ^  does  consist  of  actual  functions  or  predicates.   This 
guarantees  that  the  axioms  of  extensionality  are  satisfied.   Finally, 
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recall  that  in  the  proof  of  Theorem  3* 3*1  it  was  shown  that  h(c)  =  M 
for  any  defining  axiom  C  associated  with  a  wff  \YB.   Thus,  (h,M)  satisfies 
all  wffs  in  I  and  so  (h,  M)  is  a  model  of  t(A)  U  I. 

Conversely,  suppose  (h,  M)  is  a  normal  model  of  t(A)  U  I.   (h,  M) 
may  be  assumed  to  satisfy  conditions  2  and  h     of  Lemma  3«3»3«   Define  a 
local  frame  (g, H)  for  the  \-calculus  as  follows: 

1.  H  =  M  for  all  types  under  consideration: 

Hw  =  ttrue>  false] ; 

2.  g(X)  =  h(c  )  for  all  identifiers  X;  g  maps  the  special  iden- 
tifiers onto  their  regular  meanings. 

It  must  be  shown  that  (g,H)  is  a  Henkin  frame,  that  is  conditions  1-6  in 
Definition  3- 1«1  are  satisfied.   Conditions  1-5  are  obviously  satisfied. 
To  show  condition  6,  closure,  it  is  only  necessary  to  show  g(E)eH  for 
formulas  E  of  the  form  (A.YE').   For,  recalling  a  previous  remark,  an  appli- 
cation of  a  function  to  an  argument  cannot  produce  an  object  outside  H  if 
the  function  and  argument  are  both  in  H.   By  induction  on  the  construction 
of  the  defining  axiom  for  E  it  will  now  be  shown  that  the  name  d  associated 
with  E  satisfies  h(d)  =  g(E)  so  that  g(E)  is  in  H. 
Case  1 

B  is  a  subformula  of  E  consisting  of  a  sequence  of  applications 
and  containing  no  \'s  or  special  identifiers.   Let  t  be  the  term  associated 


with  B  and  let  Yn ,  ...  Y  be  the  marked  identifiers  and  u, ,  ...  u 


the 


associated  variables  of  K.   Then  8(t)  =  g'(B)  where  g'  =  g  except  possibly 

at  the  marked  identifiers  where  g'(Y. )  =  S(u.).   For  if  B  is  a  marked  iden- 

l       i 

tifier,  the  condition  obviously  holds.   If  B  is  an  unmarked  identifier, 
t  is  c^  and  5(c_)  =  h(c_)  =  g(B)  =  g'(B).   Suppose  B  is  (FD)  and  the 

-O  D  B 
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condition  holds  for  F  and  D.   Then  t  is  f(t„,t  ),  and 

6(t)  =  h(f)(5(tF),5(tD)) 
=  6(tF)(8(tD)) 
=  g'(F)(g'(D)) 
=  g'(FD) 
=  g'(B). 
The  second  equality  follows  from  condition  k     of  Lemma  3«3«3« 
Case  2 

B  is  a  subformula  X.YB .,    B,  not  of  type  TV,  and  suppose  B,  contains 
no  \'s  or  special  identifiers.   Then  the  defining  axiom  is 
VVy  Vux  ...  Vum(tB  =  f(d(ur  ...  um),vy)). 

Because  this  axiom  is  satisfied,  it  must  be  that 

8(tB  )  =  5(f(d(ur  ...  um),vy)) 

for  any  5  in  M  .   But,  recall  that 

6(f(d(u1,  ...  um),vY))  =  5(d(u]_,  ...  um))(5(vY)). 

Also,  by  Case  1,  8(t   )  =  g'(Bn)  where  g1  is  the  assignment  of  identifiers 

B1        1 

associated  with  8.   Then, 

8(d(ur  ...  um))(8(vy))  =   g,(B1). 

Thus,  as  8(v  )  varies,  the  function  S(d(u  ,  ...  u  ))  maps  out  the  same  set 
of  values  that  g'  (X.YB, )  does  as  g'(Y)  varies,  i.e. 

&(d(ux,  ...  uj)  =  g'(\YB1). 
Clearly  if  there  are  subformulas  of  B  beginning  with  X.  and  the  above  condition 
holds  for  the  terms  associated  with  these  subformulas,  the  analogous  remarks 
of  Cases  1.  and  2.  hold. 
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Case  3 

B  is  an  application  (FD)  of  type  TV.   Suppose  S(t  )  =  g'(F)  and 
5(tD)  =  g'(D)  for  all  5eM  ,  g'  as  in  Case  1.   t-^B)  is  P(t  ,t  ).   By  condition 
k   of  Lemma  3-3.3,  5eh(P(tF,tD))  if  and  only  if  5(t  )(8(t  ))  is  true,  i.e. 
gT(F)(g'(D))  =  true.  Again,  because  the  special  identifiers  mean  the  same 
in  both  systems,  this  result  holds  when  B  contains  these  as  well. 
Case  k 

B  is  of  form  X.YB, ,  B-,  of  type  TV.   The  defining  axiom  is 
VVy  VU;L  ...  Vum(x1(B1)  -  P(d(u]_,  ...  um),vy)). 

The  proof  of  Case  2  can  be  used  with  minor  modifications  to  show  that 

S(d(u1,  ...  uj)  =  g,(XYB1) 
if  5  and  g'  are  related  as  above. 

In  Case  2  or  k   ,  if  B  is  E  itself,  the  result  is  that 
5(d)  =  g(E),  that  is  h(d)  is  g(E),  thus  showing  g(E)  is  in  H,  and  con- 
dition 6  is  satisfied.   Then  (g,H)  is  a  Henkin  frame.   Clearly  (h, M)  is 
the  realization  of  K  associated  with  (g,H).   By  Theorem  3«3«1,  g(A)  =  true 
if  and  only  if  h(x(A))  =  M  .   But,  it  is  assumed  in  this  case  that 
1i(t(A))  is  M  ,  and  so  g(A)  =  true.   That  is,  (g,H)  is  a  Henkin  model  of  A. 

In  Chapter  k,    some  practical  uses  of  the  material  in  Chapter  3 
are  considered. 
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h.      COMPUTER  IMPLEMENTATION  OF  n-SORTED  LOGIC 
FOR  HIGHER-ORDER  LOGIC 
k.l     Preliminary  Comments 

The  results  of  Chapter  3  show  that  n-sorted  logic  is  an  alternative 
to  the  \-calculus  as  a  vehicle  for  automating  higher-order  logic.  The  ease 
of  altering  existing  first-order  theorem-provers  for  use  with  n-sorted 
languages  and  the  vast  experience  of  researchers  with  first-order  languages 
are  among  the  advantages  of  the  use  of  n-sorted  logic.   Moreover,  the  com- 
pleteness of  most  of  the  strategies  used  for  first-order  theorem-provers  can 
be  proved  by  showing  completeness  for  the  ground  case  and  then  applying  a 
lifting  lemma.   But  the  lifting  lemma  holds  for  n-sorted  logic  as  well,  and 
the  ground  metatheorems  are  the  same  for  both  the  classical  first-order  and 
n-sorted  problems.   Thus,  strategies  whose  first-order  completeness  was  proved 
in  this  way  will  also  be  complete  for  n-sorted  problems.   Thus,  for  example, 
set  of  support,  merging,  subsumption,  linearity,  deletion  of  tautologies,  and 
hyper-resolution  are  all  complete  for  n-sorted  problems  [ see  references  cited 
on  page  3] • 

However,  there  are  some  difficulties  which  arise  when  this  approach 
is  taken  for  automating  higher-order  logic.   It  will  be  the  purpose  of  this 
chapter  to  consider  two  of  these  problems  as  well  as  suggest  a  practical  im- 
plementation of  this  method. 

The  first  of  these  problems  has  to  do  with  the  inclusion  of  the 
Henkin  axioms  I  with  the  negation  S  of  the  theorem  to  be  proved.   Robinson 
and  Wos  [1^]  have  considered  this  problem  in  conjunction  with  their  work  on 
problems  which  include  the  equality  axioms.   They  point  out  that  many  resolu- 
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tions  are  usually  required  to  produce  the  effect  of  applying  one  axiom. 
In  addition,  the  intermediate  resolvents  are  added  to  the  memory  even  though 
it  is  only  the  last  resolvent  that  is  germane  to  the  theorem  being  proved. 
A  more  natural  approach  is  to  replace  the  axiom  by  a  new  operational  rule 
whose  effect  is  just  the  application  of  the  axiom.  Robinson  and  Wos  intro- 
duced paramodulation  to  replace  the  equality  axioms;  a  new  rule  of  in- 
ference, called  naming,  will  be  introduced  below  as  a  possible  alternative 
to  the  inclusion  of  the  defining  axioms  of  I. 

Although  the  set  of  defining  axioms  is  infinite,  Theorem  2.3.1 
says  that  if  S  U  I  is  inconsistent,  there  will  be  a  finite  subset  of  I,  say 
I',  such  that  S  U  I'  is  inconsistent.   This  raises  the  second  problem- -while 
it  is  necessary  to  consider  only  some  finite  subset  of  closure  axioms,  the 
mathematician  does  not  know  beforehand  which  finite  subset  will  be  sufficient 
for  his  problem.   One  solution  to  this  would  be  to  generate  expanding  sub- 
sets of  I  (which  is  recursively  enumerable).   But  this  produces  inefficient 
algorithms  analogous  to  those  in  use  before  resolution  was  introduced  which 
generated  initial  segments  of  the  Herbrand  Universe.  Fortunately,  the  method 
below  partially  solves  this  problem.   It  will  be  shown  that,  in  a  sense,  S 
itself  and  its  resolvents  contain  all  the  information  needed  to  determine 
which  axioms  of  I  to  include  in  the  problem. 
k.2     The  Naming  Rule 

As  motivation  for  the  introduction  of  the  naming  rule  and  the 
theorems  in  the  next  section  consider  the  following  example  from  Robinson 
[18]. 
Example  ^.2.1 

Show  that  the  wff 

( (IMPLIES (Qa))(Sa)) 


5h 
is  a  logical  consequence  of  the  two  wffs 

( Vr (( IMPLIES (RP))(RQj))  and  ( (lMPLIES(Pa))(Sa)) ; 
that  is,  for  any  Henkin  frame  (g,H)  in  which  g  maps  the  last  two  wffs  onto 
true,  g  maps  the  first  wff  onto  true.   Call  these  wffs  A,  B,  and  C  respec- 
tively.  There  are  three  types  in  this  problem: 

1.  individuals,  say  type  a  —  a  is  the  only  symbol  of  this  type 
in  the  above  wffs; 

2.  predicates,  that  is,  type  (a  ■*  TV)  —  P,  Q,  and  S  are  of  this 
type; 

3-   predicates  of  predicates,  type  ((a  ->  TV)  ->  TV)  --  P  is  of  this 
type. 
(Of  course,  the  types  of  the  logical  connectives  are  present  as  well  as  the 
symbol  CHOICE/      -i  because  of  the  quantification  over  R.  However,  these 
are  not  translated  into  separate  types  and  symbols  of  the  associated  n-sorted 
language. ) 

First,  it  will  be  shown  that  ( (IMPLIES (Qa)) (Sa))  is  a  logical  con- 
sequence of  the  other  two  wffs.   Let  (g,H)  be  a  Henkin  frame  in  which  g 
assigns  the  value  true  to  the  wffs  B  and  C.   The  wff  C  is  a  wff  of  type  TV 
involving  the  identifier  P.   It  can  be  considered  to  be  expressing  a  property 
of  P.   Formally,  this  property  is  given  by  the  wff 

(\X( (IMPLIES (Xa) ) (Sa) ) ) . 
Call  this  D.   Then  clearly  g(DP)  is  true.   However,  substituting  D  for  R  in 
B,  one  concludes  that  g(DQ)  is  true  also.   But  then 

g((lMPLIES(Qa))(Sa)) 
is  also  true. 

Now,  consider  the  negation  of  the  above  problem  and  its  transla- 
tion into  a  3-sorted  system.   This  negation  asserts  that  B  and  C  hold  while 
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A  fails;  that  is,  B,  C,  and  (Qa)  hold  but  (Sa)  is  false.   The  translation 
yields  the  "wff 

(VvR(F(Vep)  -F(vR,cQ)))  -  (G(Cp,oa)  -G(os,oa)) 

-G(oQ,oa)^-,G(os,oa) 

where  F  and  G  are  3-sorted  relation  symbols  of  types 

((a  •+  TV)  ->  TV,  (a  ->  TV))  and  (a  ->  TV,  a)  respectively.   The  Skolem  normal 

form  with  matrix  in  conjunctive  normal  form  is 

VvE((-,F(Vcp)  -F(yR,cQ))  -  (  -1G(op,ca)vC(Vca)) 

-G(o(i,oa).-.-iG(Vca)). 

Only  one  pair  of  matrix  clauses  is  resolvable,  namely  -j  G(c  ,  c  )  v  G(c  ,c  ) 

and  -iG(cc,c  ).   They  yield  the  resolvent  -|G(c  ,c  )•   In  order  to  get  a 
o  a  .Fa 

refutation,  it  is  necessary  to  include  the  defining  axiom  for  the  \-wff 

(\X( (iMPLIES(Xa) ) (Sa) ) ) . 
This  axiom  is 

Vvx((G(vx,Ca)  ->G(cs,ca))  *F(d,vx)) 

where  d  is  a  new  constant  symbol.   In  clause  form,  this  axiom  is 
(G(vx,ca)  v  F(d,vx))  *  (  ->G(cs,Ca)  v  F(d,vx)) 

-  (  -,F(d,vx)  ^^G(vx,ca)  vG(cg,Ca)). 

The  first  two  clauses  are  equivalent  to 

((G(vx,ca)  ->G(cs,Ca))  -F(d,vx)) 

and  the  third  clause  is  equivalent  to 

(F(d,vx)  -  (G(vx,ca)  -G(cs,ca))). 

adding  these  three  clauses  to  the  negation  of  the  theorem  allows  the  following 
refutation: 
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1.  -»F(vR,cp)  s/  F(vR,  Cq)  --  original  clause 

2.  1  G(c  ,  c  )  v  G(c  ,  c  )  --  original  clause 

ir  a      o  a 

3.  G(c  , c  )  —  original  clause 

fcj,  a 

k,     -i  G(c_,  c  )  —  original  clause 

5.  G(vx>ca)  v  F(d,v  )  —  axiom 

6.  -»G(cg,c  )  v  F(d,v  )  —  axiom 

7-   T  F(d,vx)  v  -|  G(vx,  c&)  v  G(cg,ca)    --  axiom 


8.  G(c  c  )  v-  F(d,c  )      --  from  2  and  5 

o   a  Jr 

9.  F(d,c  )  —  from  k   and  8 

10.  F(d,  cn)  —  from  1  and  9 

11.  -,  G(cn,c  )  v  G(cQ,c  )   --  from  7  and  10 

iqj,  a       b  a 

12.  G(c„, c  )  —  from  3  and  11 

b  a 

13-   empty  clause  —  from  k   and  12 

Note  that  it  took  two  steps  to  apply  the  axiom.   One  would  prefer 

to  deduce  F(d,  c_.)  directly  from  -|G(c_,c  )  s/  G(c_,  c  ).   This  leads  to  the  con- 

Jr  1   a       b  a 

cept  of  naming. 

Definition  1<-.2.1 

Let  C  be  a  clause.   Let  C  be  any  other  clause  such  that  C  and  C* 

have  no  variables  in  common  but  do  have  a  common  instance.   Let  a  be  a  most 

general  substitution  such  that  Ca  =  C'a.   Let  the  variables  of  C'  be 

x,  u., ,  . . .  u  .   The  wf  f 
'      Ir  m 

Vx  V  un  ...  V  u  (C  «*  P(d(u_.  ...  u  ),x)), 
1       m         1'      m  '  * 

where  d  is  a  new  function  symbol,  is  called  a  naming  axiom  for  C.   The 
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naming  rule  states  that  from  C  one  can  deduce 

(P(d(u1,  ...  um),x))a. 

The  unnaming  rule  states  that  from 

D  v  P(d(tn,  ...  t  ),s) 
one  can  deduce  D  v  c"  where 

C"  =  C'(s/x,  tn/u,,  ...  t  /u  ). 

Note  that  a  naming  axiom  is  just  the  defining  axiom  for  some  wff 
\XC  (recall  section  3*3) •   In  this  case,  however,  the  wff  C  is  of  a  special 
form,  namely  a  clause.   As  in  Definition  3*2.^,  the  term  d(u,,  ...  u  )  names 
the  predicate,  defined  by  the  clause  Cf,  being  applied  to  the  variable  x. 
Also  note  that  there  is  more  than  one  naming  axiom  associated  with  a  clause 
C.   In  fact,  there  are  generally  infinitely  many.   Later  it  will  be  suggested 
that  future  research  be  directed  toward  finding  suitable  refinements  of  the 
naming  axioms.   Finally,  the  unnaming  rule  is  completely  equivalent  to  resolu- 
tion with  the  clause 

-iP(d(u1,  ...  um),x)  v  C 
This  corresponds  to  the  implication  *-  of  the  axiom. 

Returning  to  Example  k.2.±,    naming  allows  the  following  shorter 
refutation: 

1.  -  k.    --  these  clauses  are  as  in  Example  ^.2.1 

5.  F(d,  c  )  —  name  2  using 

Vx((  -,G(x,ca)  ^G(cs,ca))  ~F(d,x)) 

6.  F(d,  c  )  --  from  5  and  1 


7.  -»G(cq,  ca)  v  G(cs,ca)    —  unname 

8.  G(cQ, c  )  --  from  7  and  3 

9.  empty  clause  —  from  8  and  h 
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In  the  above  deduction,  a  number  of  other  wffs  could  also  have  been  used  for 
the  naming  axiom.   For  example,  the  wff 

V  x  Vu((  -,G(x,ca)  v  G(u,ca))  -  F(d1(u),x)) 
could  have  been  used.   In  any  realization  (h,  M)  of  K  in  which  M/    _.-v 
consisted  of  actual  predicates,  h(d)  and  h(d,(cq))  would  be  exactly  the  same 
predicate. 
k,3     Another  Look  at  the  Closure  Axioms 

In  Chapter  3,    the  emphasis  was  on  developing  a  relationship  be- 
tween the  ^.-calculus  and  an  n-sorted  language.   The  axioms  introduced  there 
were  used  to  establish  the  relationship.   There  was  no  need  to  consider  their 
particular  form.   The  emphasis  here  is  on  the  development  of  actual  theorem- 
proving  programs.   Thus,  the  form  of  the  logical  statements  involved  is  a 
major  consideration.   One  would  guess  that  the  axioms  of  extensionality 
would  be  used  in  relatively  few  problems.  Even  so,  their  form  is  known  ahead 
of  time,  and  they  can  always  be  included  in  the  problem.  A  similar  remark 
holds  for  the  TV  axioms  in  case  they  are  needed.   In  a  computer  program, 
these  axioms  would  probably  be  handled  in  a  special  way,  separate  from  the 
clauses  of  the  theorem  itself.   This  leaves  the  defining  axioms.   The  diffi- 
culty with  these  is  that  it  is  generally  not  known  beforehand  which  axioms  to 
Include  with  a  particular  theorem.   The  intent  of  this  section  is  to  show 
that  these  axioms  can  be  put  into  a  special  form.   Then,  in  the  next  section 
it  will  be  shown  that  this  special  form  allows  one  to  deduce  from  the  theorem 
itself  which  axioms  to  include. 

Consider  the  defining  axiom  associated  with  a  wff  KXA   of  type 

(a  -*  P),  3  /  TV.   This  axiom  has  the  form 

Vv  Vu  ...  Vu  (t  =  f(d(un,  ...  u  ),vv)). 
Y    1        m         1?      m  '  Y 
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This  axiom,  when  added  to  the  Skolem  normal  form  of  a  theorem  S,  yields 
just  the  equality  unit 

t  =  f(d(ux,  ...  um),  vy). 
This  unit  can  resolve  only  with  a  literal  of  the  form 

-,  (s-j,  =  f(x,s2))  or  -,  (S]_  =  f(d(tp  ...  tm),s2)). 

Suppose  the  function  d  has  not  yet  been  introduced,  that  is,  the  program  is 
contemplating  the  addition  of  the  above  axiom.   Then  only  literals  of  the  form 
~i  (s,  =  f(x,  s  ))  are  relevant  "because,  of  course,  the  function  symbol  d  can- 
not occur.   Thus  in  considering  which  axioms  of  this  form  to  include,  a  pro- 
gram can  look  for  literals  of  the  form  -|  (s]L  =  f(x,  Sp))  during  the  theorem- 
proving  process.   The  appropriate  equality  unit  defining  the  new  function 
symbol  can  be  added  at  that  time.   If  the  theorem-prover  does  this  for  all 
such  literals,  then  all  the  necessary  axioms  of  this  form  will  be  included. 
For,  if  yet  another  axiom  of  this  form  were  added,  it  would  either  not  be 
resolvable  with  any  other  literal  or  it  would  resolve  with  a  literal 
-7  (s  =  f(x,  s  ))  and  an  axiom  already  naming  this  function  would  have  been 
introduced.   These  remarks  prove  the  following  lemma. 
Lemma  k.~5.~L 

The  defining  axioms  for  wffs  of  the  form  \YB  not  of  type 
(a  -»  TV)  can  be  added  to  the  theorem  during  the  theorem-proving  process. 

Finally  there  are  the  axioms  associated  with  formulas  of  the  form 
A.YB  of  type  (a  -»  TV).   It  will  be  shown  that  these  can  be  put  into  a  form 
similar  to  the  form  of  a  naming  axiom  described  in  Section  k.2.      Some  pre- 
liminary results  are  given  first. 
Lemma  k.^>.2 

In  any  wff  of  the  X-calculus  of  type  TV,  it  can  be  assumed  that 
OR,  NOT,  and  V  are  the  only  logical  connectives  that  appear. 
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Troof 

In  the  standard  way  (see  Church  [2])  one  can  express  the  special 
identifiers  AND  and  IMPLIES  in  terms  of  OR  and  NOT  and  the  existential 
quantifier  d  in  terms  of  NOT  and  V.   That  is, 

((AND  A)B)  is  equivalent  to  NOT((0R(N0T  A))(N0T  b)), 
((IMPLIES  A)B)  is  equivalent  to  ((0R(N0T  A))b),  and 
(3XA)  is  equivalent  to  (N0T(V  x(N0T  A))). 
Lemma  ^.3«3 

Let  A  be  a  wff  and  B  a  subwff  of  A  containing  an  identifier  Y. 
Let  Y'  be  an  identifier  not  occurring  in  A,  and  let  B'  be  obtained  from  B 
by  replacing  all  occurrences  of  Y  by  Y'.   Then  B  can  be  replaced  by 
((\Y'B')y)  in  A  without  affecting  the  value  of  A  in  any  Henkin  frame  (g,H). 
Proof 

To  evaluate  g(A),  one  must  first  evaluate  g(B).   But 
g(B)  =  g((\Y'B')Y).   To  see  this,  note  that  g(\Y'B*)  is  that  function  which 
yields  g'(B')  when  applied  to  g'(Y')  where  g'  =  g  for  all  identifiers  ex- 
cept possibly  Y',  and  g'(Y')  is  allowed  to  vary  over  the  appropriate  domain. 
If  g'(Y')  =  g(Y),  then  clearly  g'(B')  =  g(B).   Thus,  the  function  g(XY'B') 
applied  to  g(Y)  yields  g(B).   That  is  g(B)  -  g((\Y'B*)Y). 
Theorem  k. 3.1 

Given  a  wff  \YB  of  type  (a  ->  TV),  there  exists  a  formula  X.YC 
satisfying 

1.  g(VYB)  =  g(^YC)  for  any  Henkin  frame  (g,H),  and 

2.  for  every  subformula  of  the  form  \ZD  of  \YC  of  the  type 

(P  ->  TV),  D  is  either  a  clause,  NOT  applied  to  an  atom,  or 
V  X  applied  to  an  atom. 
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Proof 

By  Lemma  k.3-2,    it  may  be  assumed  that  OR,  NOT,  and  V  are  the  only 
logical  connectives.  Next,  note  that  sequences  of  more  than  one  NOT  may  first 
he  simplified  to  have  at  most  one  application  of  NOT  by  using  the  fact  that 
g(NOT(NOT  F))  =  g(F). 

Suppose  condition  2  fails  for  X.YB.   There  must  be  a  subformula 
\ZD  where  D  fails  to  satisfy  one  of  the  three  conditions.   D  is  of  type  TV 
so  if  D  is  not  a  clause  it  must  be  NOT  D,  or  V  XD,  where  D  is  of  type  TV. 
In  either  case,  if  D  is  a  clause,  the  subformula  D  can  be  replaced  by 
(\Z'D')Z  where  Z'  is  a  new  identifier  not  occurring  in  \YB,  Z,  is  some 
identifier  in  D  ,  and  DJ  is  obtained  from  D,  by  replacing  Z,  by  Z' .   By 
Lemma  k.~5-3,    g(^-YB)  will  not  be  changed.   However,  (\Z'D')Z,  is  an  atom,  so 
that  after  the  replacement,  the  NOT  or  V  is  applied  to  an  atom.   Similar  re- 
placements can  be  made  if  D  is  NOT  D  or  V  XD  . 

By  repeating  the  process  a  sufficient,  but  finite,  number  of  times, 
one  obtains  a  formula  A.YC  satisfying  the  conditions  of  the  theorem. 

Some  properties  will  now  be  established  in  preparation  for  the  final 
theorem  of  this  section.   Consider  the  defining  axiom  of  a  subformula  \ZD 
where  D  is  a  clause.   Because  the  logical  connectives  OR  and  NOT  are  mapped 
onto  the  logical  connectives  ^  and  -ts  of  K,  the  axiom  is 

(Vvz  VUi  ...  Vum(C  ~  Q(d(U;L,  ...  um),vz))) 

where  C  is  a  clause.   The  atoms  of  C  are  P. (t„, t„)  where  (FG)  is  an  atom  of  D. 

1  F'  G 

Similarly,  if  D  is  NOT  D  ,  D  an  atom  (FG),  the  defining  axiom  is 

(Vv  Vun  ...  Vu  ((  -,  P(t_,t_ ))  -  Q(d(u,,  ...  u  ),vj)). 
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If  D  is  V  XD  ,  D  an  atom  (FG),  the  axiom  is 

(Vvz  VU;L  ...  V^m((VvxQ(tF,tG))  -P(d(ur  ...  um),vz))). 

Before  these  axioms  are  added  to  a  set  S  of  clauses  to  be  given  to  a  theorem- 
prover,  they  are  put  in  Skolem  normal  form  and  the  universal  quantifiers  are 
dropped.   The  first  two  axioms  with  quantifiers  deleted  are  of  the  form 
C  **■  a  where  C  is  a  clause  and  a  is  a  unit  whose  first  argument  starts  with 
a  new  function  symbol.   The  third  axiom  is  V  v„  V  u,  ...  Vu  ((Vv  C)  *  a) 
which  is  equivalent  to  (see  Kreisel  and  Krivine  [10]) 

(V  vz  VU;L  ...  Vum  V  vx(a  -»  C))  *  (V  vz  VU;L  ...  Vu^  3  vx(c  -a)). 

In  taking  the  Skolem  normal  form  of  this  formula,  the  variable  v  in  the 
second  conjunct  would  be  replaced  by  a  new  Skolem  function  f(v  ,u , ,  ...  u  ). 
The  result  after  this  replacement  and  the  deletion  of  quantifiers  is  the  wff 
(C  ->  a)*,  obtained  from  (C  ->  a)  by  replacing  vv  by  f(v  ,u.,,  ...  u  ).   However, 
the  problem  is  to  determine  if  S  is  inconsistent.   If  J  is  a  clause  and  I  is 
an  instance  of  J,  then  S  U  {1}  inconsistent  implies  S  U  (J)  inconsistent. 
Then,  in  the  above,  one  can  just  as  well  add  (C  ->  a)  to  S  instead  of 
(C  ->  a)'.   Thus,  for  the  third  axiom  one  can  again  add  the  quantifier  free 
formula  (C  ■»  a)  to  S  as  the  representative  of  that  axiom.   In  all  three  cases, 
then,  the  defining  axiom  gives  rise  to  a  formula  of  the  form  (C  ■»>  a),  C  a 
clause,  to  be  included  with  the  set  of  clauses  given  to  the  theorem-prover. 
The  form  of  this  formula  is  that  of  the  matrix  of  a  naming  axiom. 

The  results  of  this  section  on  the  closure  axioms  are  collected  into 
Theorem  k.3-2 

One  can  take  as  a  set  of  quantifier  free  formulas  of  K  representing 
the  defining  axioms  a  set  of  formulas  each  of  which  is  an  equality  unit 
t  =  f(d(u,,  ...  u  ),x)  or  of  the  form  C  ■«■  P(d(u, ,  ...  u  ),x),  C  a  clause. 
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k-.k     Some  Completeness  Results 
Definition  k.k-.l 

A  procedure  is  called  refutation  complete  or  just  complete  for 
n-sorted  logic  if,  given  a  quantifier  free  set  S  of  formulas  which  is 
inconsistent,  that  procedure  demonstrates  that  S  is  inconsistent  in  a  finite 
number  of  steps.  A  procedure  "which  is  refutation  complete  is  called  a  proof 
procedure. 

Resolution  and  most  of  the  strategies  used  with  resolution  are 
known  to  be  complete  refutation  procedures.   In  this  section  a  proof  procedure 
will  be  mentioned  for  n-sorted  logic  when  used  to  represent  higher-order  logic. 
Given  the  translation  S  of  a  \-calculus  sentence  which  is  inconsistent,  this 
procedure  will  demonstrate  that  S  is  inconsistent  in  a  finite  number  of  steps. 
The  procedure  itself  will  generate  any  defining  axioms  needed.   This  particu- 
lar proof  procedure  is  not  computationally  feasible  and  is  mentioned  for 
theoretical  purposes  only.   It  will  be  suggested  that  alternate  procedures 
based  on  some  refinement  of  the  naming  rule  be  developed  and  investigated. 
Unfortunately,  the  question  of  the  completeness  of  such  procedures  is  still 
open. 

The  basis  for  the  following  is  Corollary  1  of  Slagle  [ 19] •   This 
result  is  stated  here  for  reference. 
Definition  k.k.2 

A  clause  D  subsumes  a  clause  C  if  there  is  a  substitution  cr  such  that 
Da  is  contained  in  C. 

Note  that  it  is  not  necessary  that  Da  equal  C,  but  only  that  every 
literal  of  Da  occur  in  C. 
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'iheorem  4.4.1  (Corollary  1,  Slagle  [19]) 

Let  E  be  a  set  of  clauses  and  let  F  be  a  finite  set  of  ground 
clauses.   If  E  *  (  -» F)  is  inconsistent,  then  there  is  a  finite  set  I  of 
clauses  such  that 

1.  each  clause  of  I  is  in  E  or  is  deducible  by  resolution  from  E; 

2.  each  clause  C  of  F  is  subsumed  by  some  clause  D  of  I; 
3*   I  ^  (  -|  F)  is  inconsistent; 

4.   every  predicate  symbol  occurring  unnegated  (negated)  in  I 
occurs  unnegated  (negated)  in  both  E  and  F;  and 

5-   every  function  symbol  and  constant  symbol  occurring  in  I 
occurs  in  both  E  and  F. 
In  the  above,  -j  F  is  the  negation  of  the  conjunction  of  the  clauses  in  F. 
Some  properties  "will  now  be  developed  in  order  to  apply  Slagle' s  work  to 
higher-order  logic.   Let  A  be  a  sentence  of  the  \-calculus  which  is  locally 
inconsistent.   By  Theorems  3*3*2  and  2.3*1,  there  is  a  finite  set  I  of 
defining  axioms  such  that  t(A)  U  I  U  {axioms  of  extensionality}  is  incon- 
sistent.  Let  S  be  the  set  of  clauses  obtained  from  t(A)  and  the  axioms  of 
extensionality  and  let  I,  be  the  set  of  quantifier  free  formulas  obtained 
from  I.   By  Theorem  4.3*2,  it  can  be  assumed  that  I  contains  only  formulas 
of  the  form 

t  =  f  (d(u, ,  . . .  u  ),x)  and  C  ■♦*  M. 
Lemma  4.4.1 

With  the  notation  above,  there  is  a  finite  set  I'  of  ground 
instances  of  formulas  in  I,  such  that  S  U  I,'  is  inconsistent. 
Proof 

Herbrand's  Theorem  states  that  a  set  T  of  quantifier  free  formulas 
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is  inconsistent  if  and  only  if  there  is  a  finite  set  T'  of  instances  of  for- 
mulas in  T  -which  is  inconsistent  in  the  Boolean  sense.   S  U  I,  is  inconsis- 
tent, so  there  is  a  finite  set  of  Boolean  inconsistent  instances.   Call  this 
set  S'  U  I-I  where  S'  is  the  set  of  ground  instances  of  formulas  in  S  and 
similarly  for  I-I .   But  then  S  U  I '  is  inconsistent  for  it  does  have  a  finite 
set  of  Boolean  inconsistent  instances,  namely  S'  (J  I-I  • 

In  the  following,  I'  -will  be  used  for  -7  F  in  Theorem  k.k.l.      Then 
-j  I'  corresponds  to  the  F  of  that  theorem.   However,  -7  I*  must  first  he  put 
into  the  form  of  a  set  of  clauses,  that  is,  in  conjunctive  normal  form.  Now, 
I'  is  a  conjunction  of  wffs  of  one  of  the  forms 

t  =  f(d(tr  ...  tm),s),   (C-»M),   or   (M-C) 
where  C  is  a  clause  and  M  is  a  unit.   By  elementary  Boolean  algebra,  -|  I'  can 
be  written  as  a  disjunction  of  wffs  of  one  of  the  forms 

-,  (t  =  fCdCt^  ...  tm),s)),   -,(C->M),   or  -,(M->C). 
The  last  two  are  equivalent  to 

(L  a  -jM)  v  ...  v  (L  a-,m)   and  M^-jL  <n  ...  -^  -j  L 

respectively  where  L  ,  ...  L  are  the  literals  of  C.   Thus,  the  disjunctive 
normal  form  contains  a  disjunct  -j  (t  =  f(d(t  ,  ...  t  ),s))  for  each  axiom 
t  =  f(d(t  ,  ...  t  ),s)   in  I'  and  a  set  of  disjuncts  (L  ^  -j  M),  ... 
(L,  /v  -1  M),  (M  a  -j  L  a  ...  *  -1  L,)  for  each  axiom  ( (L,  ^  ...  v  L,  )  «-  M)  in  I' . 

To  see  what  the  conjunctive  normal  form  of  I-I  looks  like,  consider  the  trans- 
formation of 

(a  .a  b)  ss   (c  /v  d) 
into  conjunctive  normal  form.   By  elementary  Boolean  algebra  this  is  equiva- 
lent to 

(a  v  c)  a  (a  \/  d)  *   (b  v  c)  ^  (b  v-  d) . 
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Each  clause  of  the  result  contains  exactly  one  literal  from  each  dis- 
junct of  the  original  formula.  Moreover,  there  is  a  clause  for  each  such 
set  of  literals.   Obviously,  the  above  can  be  extended  to  any  number  of 
disjuncts  and  any  number  of  literals  per  disjunct  with  the  same  result. 
Lemma  k.k.2 

Let  F  be  the  conjunctive  normal  form  of  -i  I' .   If  C  •*  M  is  a  wff 
in  I',  then  F  contains  a  clause  D  which  contains  C  as  a  subclause.   If 

t  =  f(d(t,,  ...  t  ),s)  is  in  I',  then  F  contains  the  literal 
1'      m  1 

-|  (t  =  f(d(t..,  ...  t  ),s))  in  at  least  one  clause. 

Proof 

Let  L, ,  ...  L.  be  the  literals  of  C.   Then  the  disjunctive  normal 

form  of  -j  I'  contains  the  wffs  (L  a  -j  m),  ...  (L,  a-jM)  as  disjuncts.  By 

the  above  remark,  F  contains  a  clause  D  which  contains  the  literals 

L,,  ...  L,  .   That  is,  C  is  a  subclause  of  D.   A  similar  argument  gives  the 

result  for  t  =  f(d(t_,  ...  t  ),s). 

1'      m 

Example  k.k-.l 

Suppose  I'  consists  of  a  finite  number  of  equality  units, 

t.  =  f. (d. (u.  _,  —  u.    ),  S.).   Then  F  is  just  -i  I',  that  is  F  consists 
i    i  i  i,l      l.m.    l  1 

of  the  single  clause 

-»(ti=  fi(di(ui,i^  ■••  ^W)  ^  •" 

"-»(tk=  fk(dk(uk,l>  •••  \,m^'s^' 
Example  K.K.2 

Suppose  I'  consists  of  one  instance  of  each  implication  -»  and  *- 

of  an  axiom  ((L,  s/  1_)  •*  M),  say 

(A  ^  A-)  ■*   C  and  C  -»  (A^  v  A^). 

Then  using  elementary  Boolean  algebra,  -i  I'  is 
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-,  (((Ax  v  A2)  ->  C)  -  (C  -  (Aj_  -  A£))) 

=  -,  ((A1  v  Ag)  ^  C)  v  ^  (C  -  (Aj_  ^  Ap) 

=  ((A1   vA^-,c)v  (C  -  (  n   (Aj_  v  A£))) 

ee   (A]_  A  n  G)  v  (A2  -n  -,  C)  v  (C1  ^  -,A£  a  -,A£). 

Finally,  the  conjunctive  normal  form  F  of  ->  I-!  consists  of  twelve  clauses, 
for  there  are  two  choices  of  literals  from  each  of  the  first  two  disjuncts 
and  three  choices  from  the  last  disjunct.   The  twelve  clauses  are 

1.       (A^i^vc'),    2.       (A1  v  Ag  v  -,Aj_),    3-       (A1vA2v-)A2), 

k.  (A1  v  -,C  v  C»),    5.       (A1  s/  -,  C  v  -,  Aj_),    6.       (A1v-1Cv-,A')} 

7.  (tCv^vC1),      8.       (  -,C  v  Ag  v-iAj_), 

9-  (iCvA2vnAp,      10.       (nCvC),    11.      (nCv^Aj), 

12.  (   -,  C  ^  -,A2). 

Note  that  in  10,  11,  and  12,  the  ->  C  was  not  repeated.  Also,  clauses  h—9 
are  subsumed  by  clauses  10-12.   Thus,  a  smaller,  but  still  suitable,  set  of 
clauses  to  use  for  F  is  1-3,  10-12. 

Theorem  U.3«2  and  Lemma  h.k.2   suggest  the  following  sort  of 
algorithm  for  theorem-proving  in  higher-order  logic.   Starting  with  the 
original  set  S  of  clauses,  one  proceeds  as  in  the  usual  resolution  algorithm, 
except  that  occasionally  a  naming  axiom  is  added  to  the  set  of  clauses.   The 
combination  of  resolution  and  addition  of  naming  axioms  is  made  in  such  a  way 
that  every  possible  naming  axiom  of  each  subclause  of  each  clause  generated 
is  eventually  added  to  the  set  of  clauses.   By  Theorem  k.3 «2,  each  axiom 
needed  for  the  program  to  prove  the  theorem  is  either  an  equality  unit  or 
of  the  form  (C  ■*■  M),  C  a  clause.   Equality  units  have  already  been  discussed. 
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For  the  other  case,  by  Lemma  k.k.2,    one  can  deduce  from  A  a  clause  D  such 
that  for  some  subclause  E  of  D,  E  and  C  have  a  common  instance.  Then,  the 
axiom  (C  •*-  M)  is  a  naming  axiom  for  E,  and  the  above  algorithm  will 
eventually  add  it  to  S.   Thus,  such  an  algorithm  discovers  which  axioms  to 
add  to  the  theorem  and  is  complete. 

One  could  give  an  algorithm  that  guaranteed  that  every  such  naming 
axiom  was  eventually  added  to  S.  However,  such  an  algorithm  would  not  be 
computationally  feasible  because  there  are  infinitely  many  such  axioms  for 
each  clause.  Also,  the  above  algorithm  adds  all  the  clauses  representing 
these  axioms  to  S  rather  than  applies  each  axiom  in  some  single  operation. 
Thus,  such  an  algorithm  is  of  purely  theoretical  interest. 

It  will  now  be  shown  how  certain  applications  of  the  naming  rule 

alleviate  the  second  problem  above,  i.e.  the  addition  of  axioms.   Consider 

the  application  of  a  naming  rule  for  a  clause  C  whose  literals  are 

ij-. ,  ...  j_l  .   .Le'C 

VxVn   ...  V  u  (C  «*  M) 
1        m 

be  the  naming  axiom  used  where  the  literals  of  C'  are  L' ,  ...  JJf    and  let 

a  be  the  substitution  such  that  Ccr  =  Co".   Suppose  C,cr  =  C  so  that 

Co  -   Ccr  =  C  and  C'  is  an  instance  of  C.   The  clauses  associated  with  the 

axiom  are 

(  ->  L'  s/  M)  ...  (  -,  L£  v  M)   and   (  -7  M  v  L^  v  ...  v  IJ ) . 

The  above  algorithm  would  have  added  all  of  these  to  the  set  S.   However, 
the  naming  rule  states  that  one  can  deduce  Mcr  =  M  directly.  Note  that  M 
can  be  deduced  by  resolution  from  (L,  y  . .  .  v  L,  ),  that  is,  C,  and 
(  ->  L'  v  M),  ...  (  -)  V  v  m)  using  a  as  a  unifier  of  L.  and  -|  L! .   Thus, 
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the  naming  rule  is  equivalent  to  a  sequence  of  resolutions.   Now  note  that 
M  subsumes  each  clause  (  -j  L!  ^  M) .   Thus,  having  deduced  M,  the  clauses 
(  -|  L!  v/  m)  can  be  discarded.   That  is,  with  the  application  of  the  naming 
rule  to  C  only  the  clause  -j  M  v  L'  v  ...  v/  JJ   corresponding  to  the  second 
half  (M  -*  (L'  v  ...  v/  IJ ) )  of  the  naming  axiom  has  to  be  added  to  S.  Adding 
this  clause  to  S  is  tantamount  to  saving  a  pattern  of  the  definition  of  the 
new  function  symbol  occurring  in  M  for  future  unnamings. 

The  above  analysis  fails  if  C'ct  ^  C,  for  in  this  case,  the  naming 
rule  yields  Ma.   If  Ma  /  M,  then  Ma  does  not  subsume  (  -|  1*.  ^   M).   (Recall, 
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for  Ma  to  subsume  (  -jL1,  ^  m),  (Mct)\  must  be  a  literal  of  (  -%  L*.  v  M).   In 
this  case  (Ma)X.  must  be  M.   But,  \   cannot  undo  the  effects  of  a,  unless  a 
only  changes  variable  names.   Therefore,  (Mcr)\  cannot  be  M. )  Without  the 
subsumption  property,  the  clauses  (  -j  l!  v  M)  cannot  be  discarded.   The 
analysis  also  cannot  be  carried  out  if  one  tries  to  apply  naming  to  a 
subclause.   For  example,  if  the  clause  C  v  D  is  deduced,  one  might  deduce  by 
some  altered  naming  rule  Ma  v  Da.  Again,  this  will  not  subsume  any  clause 
(  -\   L.  \/-  M).   Thus,  the  naming  rule  achieves  its  advantage  over  adding  all 
the  clauses  associated  with  the  naming  axiom  when  it  is  applied  to  a  whole 
clause  C  and  the  naming  axiom  (C  **  M)  used  satisfies  C  =  Ca. 

In  this  chapter  it  was  shown  that  the  Henkin  axioms  I  could  be  put 
into  a  form  compatible  with  the  naming  axioms  and  the  naming  rule.   Further, 
it  was  shown  that  complete  proof  procedures  can  be  developed  using  resolution 
and  the  naming  axioms.   It  was  pointed  out  that  such  a  procedure  is  not 
efficient  enough  to  be  of  practical  interest.   Finally,  it  was  suggested 
that  the  naming  rule  could  lead  to  practical  algorithms. 
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5.   SUGGESTIONS  FOR  FUTURE  RESEARCH 
The  purpose  of  this  thesis  is  theoretical  in  rtature--to  demon- 
strate that  n-sorted  logic  is  suitable  for  representing  higher-order  logic. 
This  was  accomplished  in  Chapter  3*   This  work  offers  an  alternative  to  the 
use  of  the  \-calculus  for  automating  higher-order  logic.   It  was  not  in- 
tended to  develop  a  computer  program  for  higher-order  logic,  only  to  lay 
the  foundation  for  the  development  of  such  programs  in  the  future.   The 
author  feels  that  the  material  in  Chapter  k   will  be  of  great  use  in  this 
area.   While  no  specific,  practical  algorithm  was  given  in  that  chapter, 
some  examples  of  problems  solved  by  use  of  the  naming  rule  are  given  in  the 
Appendix. 

There  are  two  directions  along  which  future  research  should  pro- 
ceed.  The  first  of  these  is  to  determine  if  resolution  and  the  naming  rule, 
or  some  suitable  restriction  of  it,  is  complete  for  higher-order  logic.   Re- 
call (Section  k.k)   that  when  the  naming  rule  is  applied  in  certain  cases, 
it  is  not  necessary  to  add  a  number  of  the  clauses  representing  the  associa- 
ted naming  axiom  to  the  set  of  clauses  being  generated.   One  would  expect 
that  a  theorem-prover  which  takes  advantage  of  this  to  be  more  efficient 
than  a  theorem-prover  which  merely  adds  all  the  clauses  of  the  associated 
axiom  (Robinson  and  Wbs  [1^]).  Thus,  the  more  efficient  theorem-provers 
using  n-sorted  representations  of  higher-order  logic  will  most  likely  be  the 
ones  which  implement  some  form  of  the  naming  rule.   Then  from  a  practical, 
as  well  as  a  theoretical,  standpoint,  it  is  important  to  know  if  resolution 
and  naming  are  complete. 

The  second  area  for  future  work  is  the  implementation  of  resolution 
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and  naming.  After  resolution  was  introduced  for  first-order  theorem-proving, 
research  was  carried  out  for  several  years  to  develop  strategies  for  the  use 
of  resolution.   That  is,  rules  for  guiding  the  choice  of  pairs  of  clauses  to 
resolve  were  developed;  restrictions  on  when  resolution  could  be  applied  were 
introduced;  strategies  for  eliminating  certain  resolvents  were  used;  and  even 
new  operational  rules  based  on  resolution  were  developed.  All  of  these 
tended  to  increase  the  efficiency  of  resolution  style  first-order  theorem- 
provers.   Similar  work  needs  to  be  done  now  for  resolution  and  naming  for 
higher-order  logic.   Many  of  the  strategies  for  resolution  mentioned  in  the 
introduction  may  simply  be  carried  over  to  resolution  and  naming  theorem- 
provers.   But,  new  strategies  for  handling  the  naming  rule  should  also  be 
developed.  Analogues  of  some  of  the  resolution  strategies  (particularly  set 
of  support)  should  be  considered  as  well  as  entirely  new  strategies  designed 
especially  for  the  naming  rule. 
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APPENDIX 

Some  examples  of  theorems  from  higher-order  logic  and  their 
proofs  in  an  n-sorted  system  will  now  be  given.   These  examples  show  the 
ease  and  naturalness  with  which  n-sorted  logic  can  express  mathematical 
problems.   In  the  last  example,  the  problem  is  stated  directly  in  a  ^-sorted 
logic  without  ever  resorting  to  higher-order  logic.  Also,  the  reader  should 
note  that  in  some  examples,  it  was  not  necessary  to  apply  the  naming  rule. 
Thus,  resolution  alone  is  sufficient  for  some,  and  possibly  many,  higher- 
order  problems. 
Example  1 

If  F  and  G  are  predicates,  their  disjunction  is  also.   That  is 
VxVf  Vg  =i  H(  ( (FX)  v  (GX) )  *>  HX) . 
The  negation  is 

B  X  3  F  3  G  V  H(  ( (FX  v  GX)  *  -;  HX)  v  (HX  *.  -7  FX  ^  -7  GX) ) . 
This  leads  to  the  2-sorted  wff 

3  x  B  f  'd  g  V  h(((p(f,x)  v  P(g,x))  *•  -i  P(h,x)) 

^  (P(h,x)  -  -,  P(f,x)  *■  -,  P(g,x))). 
The  Skolem  normal  form  of  this  is 

Vh(((p(F,a)  v  P(G,a))  ^  -,   P(h,x))  ^  (P(h,a)  *■  -,  p(F,a)  ~  -,   P(G,a))) 
where  F,  G,  and  a  are  constant  symbols.   The  matrix  yields  the  clauses 

1.1.  P(F,a)  ^  P(G,a)  ^  P(h,a) 

1.2.  -,  P(h,a)  v  -,  P(F,a) 

1.3.  -|  P(h,a)  ^  1  P(G,a) 

The  following  is  a  refutation  of  the  denial  of  the  theorem. 
l.k.      P(d,(h),a)  —  name  1.2  using 

(  -1  P(h,x)  v  -,  P(F,x))  *  P(dx(h),x) 
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1.5'      ~i  P(d  (d  (h)),a)   —  take  negation  of  d-.(h)  using 

P(x,y)  *>  -j  P(d2(x),  y) 

1.6.  P(F,a)  v  P(G,a)       —  from  1.1  and  1.5 

1.7.  -1  P(F,a)  --  from  1.2  and  1.4 

1.8.  -7  P(G,a)  ~  from  1.3  and  l.k 

1.9.  P(G,a)  --  from  1.6  and  1.7 
Formulas  1.8  and  1.9  yield  the  empty  clause. 

Example  2 

This  example  is  taken  from  Church  [2].   He  defines  equality  between 
objects,  say  x  =  y,  as  V  f((Fx)  -»  (Fy)).   He  then  proves  some  properties 
about  equality.   Three  of  these  are  given  here. 

A.  V  x(x  =  x),  i.e.   V  x  V  f((Fx)  -»  (Fx)). 

The  negation  is  3  x  3  F((Fx)  /\  -j  (Fx)).   The  2-sorted  translation  in  Skolem 
normal  form  is  P(F,  a)  a  -?  p(F,  a)  where  F  and  a  are  constant  symbols. 
Obviously,  one  resolution  yields  the  empty  clause. 

B.  V  x  V  y(x  =  y  ->  y  =  x),    i.e. 

V  x  V  y( V  f(  (Fx)  ->  (Fy) )  ->  V  F(  (Fy)  -  (Fx)  )  )  . 
The  negation  is 

'd  x  3  y(V  F((Fx)  -  (Fy))  ^  3  F((Fy)  <s  -,  (Fx))). 
The  2-sorted  translation  is 

3x3y(V  f(P(f,x)  -  P(f,y))  -  3  f(P(f,y)  -  ->  P(f,x)). 
The  Skolem  normal  form  is 

V  f((  -,  p(f,a)  v  P(f,b))  -  P(G,b)  a  -,  p(G,a)) 
where  a,  b,  and  G  are  constant  symbols.   The  three  matrix  clauses  are 
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2.B.I.  -i  P(f,a)  *  P(f,b) 
2.B.2.   P(G,b) 
2. BO-   -iP(G,a) 
The  following  is  a  refutation. 

2.B.4.  -|  P(d(G),b)         --  take  negation  of  G  in  2.B.2  using 

P(x,y)  *  -»  P(d(x),y) 
2.B.5*   P(d(G),a)  —  same  as  2.B.^  using  2.B.3  instead 

of  2.B.2 
2.B.6.   p(d(G),b)  —  from  2.B.1  and  2.B.5 

2.B-7.   empty  clause        —  from  2.B.J+  and  2.B.6 
c.  VxVyVz(x=  y^y=  z  ->  x  =  z) 
The  negation  is 

3  x  3y  3  z(Vf((Fx)  -*  (Fy))  >n  V  F((Fy)  -  (Fz))  *  (  J3  F((Fx)  *  -,  (Fz)))). 
The  Skolem  normal  form  of  the  2 -sorted  translation  is 

V  f  V  g((  -,  p(f,a)  v  p(f,b))  *  (  -r  P(g,b)  v  P(g,c))  a  p(F,a)  *  -»  P(F,c)) 
where  a,  b,  c,  and  F  are  constant  symbols.  The  matrix  clauses  are 
2.C.I.  -7  P(f,a)  s/  P(f,b) 


2.C.2 

2.C-3 

2.C.^ 
The  refutation  is 


2.C.5 
2.C.6 

2.C7 


-:  P(g,b)  v  P(g,c) 
P(F,a) 

-1  P(F,c) 


P(F,b) 
P(F,c) 

empty  clause 


—  from  2.C.1  and  2. C.J 

—  from  2.C2  and  2.C-5 
--  from  2.C3  and  2.C.6 

Note  that  in  A  and  C  it  was  unnecessary  to  apply  the  naming  rule.   Resolution 
alone  was  enough  to  derive  a  contradiction. 
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Example  3 

Show  that  the  factor  group  of  a  homomorphism  is  isomorphic  to  the 
image  group.   This  theorem  would  probably  be  handled  as  several  lemmas. 
One  such  lemma  states  that  there  is  a  homomorphism  of  cosets  of  the  kernal 
onto  the  image  of  the  original  homomorphism.   This  lemma  will  now  be  proved. 
It  is  stated  directly  in  a  ^-sorted  system,  rather  than  stated  first  in  the 
\-calculus  and  then  translated. 

There  are  four  types.   Type  1  is  the  domain  group,  type  2  the 
image  group,  type  3  a  set  of  maps  from  type  1  to  type  2,  and  type  k-   is  a  set 
of  subsets  of  type  1  objects.   The  predicate  symbols  are  P  of  type  (l,  1,1), 
Q  of  type  (2,2,2),  R  of  type  (1,3,2),  and  S  of  type  (1,4).   The  function  sym- 
bols are  f  of  type  (2,1),  i  of  type  (1,1),  ip  of  type  (2,2),  and  g,h,  and  k, 
all  of  type  (3,1).   The  clauses  needed  for  the  lemma  and  their  meanings  are 
now  given. 

3-lp.   -i  P(xx,x2,x5)  s/  i  P(x2,x^,x5)  v  -,  P(xx,x5,x6)  v  P(x^,x^,x6) 

3-2p.   -,  P(X;L,x2,x5)  v  -,  P(x2,x^,x5)  v/  -rP(x5,x^,x6)  v  pfx^x  ,xg) 

P(x, y,  z)  means  x-y  =  z.   The  above  two  formulas  express  the  fact  that  type  1 

objects  are  associative. 

3-3p.   P(e1,x,x) 

\        —  e  is  the  identity. 
3-4p.   P(x,ei,x)   j 

3«5p«   P(x,  i,  (x),  e, )        —  i. (x)  is  the  inverse  of  x 

There  is  a  similar  set  of  axioms  3«ln  -  3«5n  using  Q,,  e  and  i  stating  that 
type  2  objects  form  a  group. 

3-  6.   -,  P(x,y,  z)  v  -,  R(x,H,u)  v-  -,  R(y,H,v)  v  -»  R(z,H,w)  v  Q(u,v,w) 

—  R(x,H, u)  means  H(x)  =  u.   This  for- 
mula states  that  H  is  a  homomorphism, 
i.e.  H(x-y)  =  H(x)-H(y) 
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3.  7-   R(f(u),H,u)         --  f(u)  is  an  inverse  of  u  under  H 
3.  8.  n  S(x,N)  s/  R(x,H,e2) 


3.  9-   -i  R(x,H,e2)  v  S(x,N) 


—  N  is  the  kernal  of  H 


3.10.  S(h(j),N)  —  given  a  map  j,  h(,j)  is  an  element 

of  the  kernal  of  H 

3.11.  P(g(j),h(j),k(j)) 

5.12.  R(g(j),j,a) 

3.13.  R(k(j),j,b) 
3.1^.  -»Q(a,e2,l3) 

Formulas  3.11-3.14  express  the  fact  that  given  any  map  j,  g(j)  and  k(j')  are 
in  the  same  coset  of  N,  but  their  images  are  not  equal.   The  refutation 
follows . 

3.15.  R(h(j),H,e2)         —  from  3-8  and  3-10 

3.16.  n  R(g(j),H,u)  v  -,  R(h(j),H,v)  v/  -i  R(k(j),H,w)  s/  Q(u,v,w) 

—  from  3-6  and  3-H 

3.17.  -»  R(h(H),H,v)  v  -,  R(k(H),H,w)  v  Q(a,v,w) 

—  from  3-12  and  3.16 

3.18.  -,R(k(H),H,w)  -  Q(a,e2,w) 

—  from  3-15  and  3-17 

3.19.  Q(a,e2,"b)  —  from  3-13  and  3. 18 

3.20.  empty  clause        —  from  3«1^  and  3«19 
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